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Abstract 



We study a heavy piston of mass M that moves in one dimension. The piston 
separates two gas chambers, each of which contains finitely many ideal, unit mass 
gas particles moving in d dimensions, where d > 1. Using averaging techniques, we 
prove that the actual motions of the piston converge in probability to the predicted 
averaged behavior on the time scale M 1 / 2 when M tends to infinity while the total 
energy of the system is bounded and the number of gas particles is fixed. Neishtadt 
and Sinai previously pointed out that an averaging theorem due to Anosov should 
extend to this situation. 

When d — 1, the gas particles move in just one dimension, and we prove 
that the rate of convergence of the actual motions of the piston to its averaged 
behavior is 0(M -1 / 2 ) on the time scale M 1 / 2 . The convergence is uniform over 
all initial conditions in a compact set. We also investigate the piston system when 
the particle interactions have been smoothed. The convergence to the averaged 
behavior again takes place uniformly, both over initial conditions and over the 
amount of smoothing. 

In addition, we prove generalizations of our results to iV pistons separating 
N + 1 gas chambers. We also provide a general discussion of averaging theory 
and the proofs of a number of previously known averaging results. In particular, 
we include a new proof of Anosov's averaging theorem for smooth systems that is 
primarily due to Dolgopyat. 
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Chapter 1 
Introduction 



What can be rigorously understood about the nonequilibrium dynamics of chaotic, 
many particle systems? Although much progress has been made in understanding 
the infinite time behavior of such systems, our understanding on finite time scales 
is still far from complete. Systems of many particles contain a large number of 
degrees of freedom, and it is often impractical or impossible to keep track of their 
full dynamics. However, if one is only interested in the evolution of macroscopic 
quantities, then these variables form a small subset of all of the variables. The evo- 
lution of these quantities does not itself form a closed dynamical system, because 
it depends on events happening in all of the (very large) phase space. We must 
therefore develop techniques for describing the evolution of just a few variables in 
phase space. Such descriptions are valid on limited time scales because a large 
amount of information about the dynamics of the full system is lost. However, the 
time scales of validity can often be long enough to enable a good prediction of the 
observable dynamics. 

Averaging techniques help to describe the evolution of certain variables in 
some physical systems, especially when the system has components that move on 
different time scales. The primary results of this thesis involve applying averaging 
techniques to chaotic microscopic models of gas particles separated by an adiabatic 
piston for the purposes of justifying and understanding macroscopic laws. 

This thesis is organized as follows. In Section 11.11 we briefly introduce the the 
adiabatic piston problem and our results. In Section 11.21 we review the physical 
motivations for our results. The following three chapters may each be read inde- 
pendently. Chapter [2] presents an introduction to averaging theory and the proofs 
of a number of averaging theorems for smooth systems that motivate our later 
proofs for the piston problem. Chapter [3] contains our results for piston systems 
in one dimension, and Chapter H] contains our results for the piston system in 
dimensions two and three. 
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1.1 The adiabatic piston 



Consider the following simple model of an adiabatic piston separating two gas 
containers: A massive piston of mass M 3> 1 divides a container in M d , d = 
1,2, or 3, into two halves. The piston has no internal degrees of freedom and can 
only move along one axis of the container. On either side of the piston there are a 
finite number of ideal, unit mass, point gas particles that interact with the walls 
of the container and with the piston via elastic collisions. When M = oo, the 
piston remains fixed in place, and each gas particle performs billiard motion at a 
constant energy in its sub- container. We make an ergodicity assumption on the 
behavior of the gas particles when the piston is fixed. Then we study the motions 
of the piston when the number of gas particles is fixed, the total energy of the 
system is bounded, but M is very large. 

Heuristically, after some time, one expects the system to approach a steady 
state, where the energy of the system is equidistributed amongst the particles 
and the piston. However, even if we could show that the full system is ergodic, 
an abstract ergodic theorem says nothing about the time scale required to reach 
such a steady state. Because the piston will move much slower than a typical gas 
particle, it is natural to try to determine the intermediate behavior of the piston 
by averaging techniques. By averaging over the motion of the gas particles on a 
time scale chosen short enough that the piston is nearly fixed, but long enough 
that the ergodic behavior of individual gas particles is observable, we will show 
that the system does not approach the expected steady state on the time scale 
M 1 / 2 . Instead, the piston oscillates periodically, and there is no net energy transfer 
between the gas particles. 

The results of this thesis follow earlier work by Neishtadt and Sinai |Sin99t 
INS04] . They determined that for a wide variety of Hamiltonians for the gas par- 
ticles, the averaged behavior of the piston is periodic oscillation, with the piston 
moving inside an effective potential well whose shape depends on the initial po- 
sition of the piston and the gas particles' Hamiltonians. They pointed out that 
an averaging theorem due to Anosov |Ano6 0, LM88], proved for smooth systems, 
should extend to this case. The main result of the present work, Theorem 14.1.14 
is that Anosov's theorem does extend to the particular gas particle Hamiltonian 
described above. Thus, if we examine the actual motions of the piston with respect 
to the slow time r = t/M 1 ^ 2 , then, as M — > oo, in probability (with respect to 
Liouville measure) most initial conditions give rise to orbits whose actual motion is 
accurately described by the averaged behavior for < r < 1, i.e. for < t < M 1 / 2 . 

A recent study involving some similar ideas by Chernov and Dolgopyat [CD06aJ 
considered the motion inside a two-dimensional domain of a single heavy, large gas 
particle (a disk) of mass M ^> 1 and a single unit mass point particle. They as- 
sumed that for each fixed location of the heavy particle, the light particle moves 
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inside a dispersing (Sinai) billiard domain. By averaging over the strongly hy- 
perbolic motions of the light particle, they showed that under an appropriate 
scaling of space and time the limiting process of the heavy particle's velocity is a 
(time-inhomogeneous) Brownian motion on a time scale 0(M 1 / 2 ). It is not clear 
whether a similar result holds for the piston problem, even for gas containers with 
good hyperbolic properties, such as the Bunimovich stadium. In such a container 
the motion of a gas particle when the piston is fixed is only nonuniformly hyper- 
bolic because it can experience many collisions with the flat walls of the container 
immediately preceding and following a collision with the piston. 

The present work provides a weak law of large numbers, and it is an open 
problem to describe the sizes of the deviations for the piston problem [CD06b|. 
Although our result does not yield concrete information on the sizes of the devia- 
tions, it is general in that it imposes very few conditions on the shape of the gas 
container. Most studies of billiard systems impose strict conditions on the shape 
of the boundary, generally involving the sign of the curvature and how the corners 
are put together. The proofs in this work require no such restrictions. In particu- 
lar, the gas container can have cusps as corners and need satisfy no hyperbolicity 
conditions. 

If the piston divides a container in R 2 or M. 3 with axial symmetry, such as a 
rectangle or a cylinder, then our ergodicity assumption on the behavior of the gas 
particles when the piston is fixed does not hold. In this case, the interactions of the 
gas particles with the piston and the ends of the container are completely specified 
by their motions along the normal axis of the container. Thus, this system projects 
onto a system inside an interval consisting of a massive point particle, the piston, 
which interacts with the gas particles on either side of it. These gas particles make 
elastic collisions with the walls at the ends of the container and with the piston, 
but they do not interact with each other. For such one-dimensional containers, 
the effects of the gas particles are quasi-periodic and can be essentially decoupled, 
and we recover a strong law of large numbers with a uniform rate, reminiscent 
of classical averaging over just one fast variable in S 1 : The convergence of the 
actual motions to the averaged behavior is uniform over all initial conditions, with 
the size of the deviations being no larger than 0(M~ 1 ^ 2 ) on the time scale M +1 / 2 . 
See Theorem 13. 1.1 1 Gorelyshev and Neishtadt [GNOS] independently obtained this 
result. 

For systems in d = 1 dimension, we also investigate the behavior of the system 
when the interactions of the gas particles with the walls and the piston have been 
smoothed, so that Anosov's theorem applies directly. Let 5 > be a parameter 
of smoothing, so that 5 = corresponds to the hard core setting above. Then 
the averaged behavior of the piston is still a periodic oscillation, which depends 
smoothly on 5. We show that the deviations of the actual motions of the piston 
from the averaged behavior are again not more than 0{M^ 1 ^ 2 ) on the time scale 
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M 1 / 2 . The size of the deviations is bounded uniformly, both over initial conditions 
and over the amount of smoothing, Theorem 13.1.21 

Our results for a single heavy piston separating two gas containers generalize 
to the case of N heavy pistons separating N + 1 gas containers. Here the averaged 
behavior of the pistons has them moving like an N- dimensional particle inside an 
effective potential well. Compare Section 13.1.31 

The systems under consideration in this work are simple models of an adiabatic 
piston. The general adiabatic piston problem [Cal63] , well-known from physics, 
consists of the following: An insulating piston separates two gas containers, and 
initially the piston is fixed in place, and the gas in each container is in a separate 
thermal equilibrium. At some time, the piston is no longer externally constrained 
and is free to move. One hopes to show that eventually the system will come to a 
full thermal equilibrium, where each gas has the same pressure and temperature. 
Whether the system will evolve to thermal equilibrium and the interim behavior of 
the piston are mechanical problems, not adequately described by thermodynam- 
ics |Gru99j . that have recently generated much interest within the physics and 
mathematics communities following Lieb's address [Lie99j . One expects that the 
system will evolve in at least two stages. First, the system relaxes deterministically 
toward a mechanical equilibrium, where the pressures on either side of the piston 
are equal. In the second, much longer, stage, the piston drifts stochastically in the 
direction of the hotter gas, and the temperatures of the gases equilibrate. See for 
example [GPL031 ICL021 IChe04] and the references therein. Previously, rigorous 
results have been limited mainly to models where the effects of gas particles rec- 
olliding with the piston can be neglected, either by restricting to extremely short 
time scales [LSC02, ICLS02] or to infinite gas containers [Chc04j. 

1.2 Physical motivation for the results 

In this section, we briefly review the physical motivations for our results on the 
adiabatic piston. 

Consider a massive, insulating piston of mass M that separates a gas container 
T> in M. d , d = 1,2, or 3. See Figure 11.11 Denote the location of the piston by Q 
and its velocity by dQ/dt = V. HQ is fixed, then the piston divides T> into two 
subdomains, T>i(Q) = T>\ on the left and T> 2 (Q) = T>% on the right. By \D^\ we 
denote the area (when d = 2, or length, when d = 1, or volume, when d = 3) of 
Pj. Define 

_ 0|Pi(Q)l d\V 2 (Q)\ 
dQ dQ ' 

so that i is the piston's cross-sectional length (when d — 2, or area, when d — 3). 
If d — 1, then I = 1. By E{ we denote the total energy of the gas inside T>i. 
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V = V 1 (Q)UV 2 (Q) 



M = e~ 2 > 1 




Figure 1.1: A gas container V in d = 2 dimensions separated by an adiabatic 
piston. 

We are interested in the dynamics of the piston when the system's total energy 
is bounded and M — > oo. When M = oo, the piston remains fixed in place, and 
each energy remains constant. When M is large but finite, MV 2 /2 is bounded, 
and so V — 0{M~ 1 / 2 ). It is natural to define 

e = M-V2, 

w = i:, 

£ 

so that W is of order 1 as e — > 0. This is equivalent to scaling time by £, and so 
we introduce the slow time 

r = et. 

If we let Pi denote the pressure of the gas inside T>i, then heuristically the 
dynamics of the piston should be governed by the following differential equation: 

i.e. (1.1) 
dr dr 

To find differential equations for the energies of the gases, note that in a short 
amount of time dt, the change in energy should come entirely from the work done 
on a gas, i.e. the force applied to the gas times the distance the piston has moved, 
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because the piston is adiabatic. Thus, one expects that 



dE l 
~df 



dE l 
dr 



-vp 1 e, 

i.e. 

-WPJ, 



dE 2 
dt 



dE 2 
dr 



+VP 2 £, 



+WP 2 i 



1.2) 



To obtain a closed system of differential equations, it is necessary to insert an 
expression for the pressures. Pit should be the average force from the gas particles 
in T>i experienced by the piston when it is held fixed in place. Whether such an 
expression, depending only on Ei and T>i(Q), exists and is the same for (almost) 
every initial condition of the gas particles depends strongly on the microscopic 
model of the gas particle dynamics. Sinai and Neishtadt [Sin99l iNSOlj pointed 
out that for many microscopic models where the pressures are well defined, the 
solutions of Equations fll.ip and (11.21) have the piston moving according to a model- 
dependent effective Hamiltonian. 

Because the pressure of an ideal gas in d dimensions is proportional to the 
energy density, with the constant of proportionality 2/d, we choose to insert 

2Ei 



d\Vi 



Later, we will make assumptions on the microscopic gas particle dynamics to justify 
this substitution. However, if we accept this definition of the pressure, we obtain 
the following ordinary differential equations for the four macroscopic variables of 
the system: 

W 

2E x i 2E 2 £ 



d 

d^ 



Q 
w 

E 1 
E 2 



d\V x {Q)\ d\V 2 {Q)\ 
2WE 1 i 



d|2M<9)l 
2WE 2 £ 



1.3) 



d\V 2 {Q)\ 

For these equations, one can see the effective Hamiltonian as follows. Since 

d\n(E t ) 2dln(|Z>i(Q)|) 



d dr 
XMQ(0))|V /d 



Hence 



dr 

d 2 Q(r) 2£E 1 (0) |I?i(g(0)) | 2/d 2£ E 2 (0) \V 2 (Q(0))\ 2/d 



dr 2 



d 



|^i(Q(r))| 1+2/d 



d 



\V 2 (Q(r))\ 1+2/d 
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effective potential 




— i -Q 

Pi = P 2 



Figure 1.2: An effective potential. 

and so (Q, W) behave as if they were the coordinates of a Hamiltonian system 
describing a particle undergoing motion inside a potential well. The effective 
Hamiltonian may be expressed as 

1 2 iS 1 (0)|P 1 (Q(0))| 2/ ' t | E 2 (0)\V 2 (Q(0))\ 2/d n4) 

2 |?MQ)| 2/d \V 2 (Q)\ 2/d ' 

The question is, do the solutions of Equation (11.31) give an accurate description 
of the actual motions of the macroscopic variables when M tends to infinity? The 
main result of this thesis, Theorem 14.1.11 is that, for an appropriately defined 
system, the answer to this question is affirmative for < t < M 1 / 2 , at least for 
most initial conditions of the microscopic variables. Observe that one should not 
expect the description to be accurate on time scales much longer than 0(M 1 / 2 ) = 
0(s^ 1 ). The reason for this is that, presumably, there are corrections of size 0(s) 
in Equation (11.31) that we are neglecting. For r = et > 0(1), these corrections 
should become significant. Such higher order corrections for the adiabatic piston 
were studied by Crosignani et al. [CDP S96J. 
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Chapter 2 

Background Averaging Material 



In this chapter, we present a number of well-known classical averaging results 
for smooth systems, as well as a proof of Anosov's averaging theorem, which is 
the first general multi-phase averaging result. All of these theorems are at least 
45 years old. However, we present them here because our proofs of the classical 
results are at least slightly novel, and the ideas in them lend themselves well to 
certain higher- dimensional generalizations. In particular, they are fairly close to 
the ideas in the proof we give for our piston results in one dimension. The proof of 
Anosov's theorem is a new and unpublished proof due mainly to Dolgopyat, with 
some further simplifications made. The ideas in this proof underly the ideas we 
will use to prove the weak law of large numbers for our piston system in dimensions 
two and three. 

We begin by giving a discussion of a framework for general averaging theory 
and some averaging results. A number of classical averaging theorems are then 
proved, followed by the proof of Anosov's theorem. 



2.1 The averaging framework 

In this section, consider a family of ordinary differential equations 

dz 

Tt =z{z ' £) {2A) 

on a smooth, finite-dimensional Riemannian manifold Ai, which is indexed by the 
real parameter e G [0, Sq\. Assume 

• Regularity: the functions Z and dZ/de are both C 1 on M x [0,£ ]- 

We denote the flow generated by Z(-,e) by z £ (t,z) = z £ {t). We will usually 
suppress the dependence on the initial condition z = z £ (0,z). Think of z e {-) 
as being a random variable whose domain is the space of initial conditions for 
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the differential equation (12.11) and whose range is the space of continuous paths 
(depending on the parameter t) in A4. 

• Existence of smooth integrals: z (t) has m independent C 2 first integrals 



Then h is conserved by Zo(t), and at every point the linear operator dh/dz has 
full rank. It follows from the implicit function theorem that each level set 



is a smooth submanifold of co-dimension m, which is invariant under Zo(t). Fur- 
ther, assume that there exists an open ball U C M m satisfying: 

• Compactness: Vc G U, M c is compact. 

• Preservation of smooth measures: Vc G U, zo(t)\M c preserves a smooth 
measure /i c that varies smoothly with c, i.e. there exists a C l function 
g : M. — > M>o such that g\jvi c * s the density of /i c with respect to the 
restriction of Riemannian volume. 



Again, think of h £ (-) as being a random variable that takes initial conditions z G A4 
to continuous paths (depending on the parameter t) in U. Since dho/dt = 0, 
Hadamard's Lemma allows us to write 




= eH(z £ ,e) 



for some C 1 function H : M. x [0, So) — > U. Observe that 



h = (h 1 ,...,h m ) : M —>W 



M c := {h = c} 



Set 



h e (t,z) = h e (t) :=h(z £ (t)). 



dh £ 
dt 



(t) = Dh(z £ (t))Z(z £ (t),e) 



Dh(z £ (t))(Z(z £ (t),e) - Z(z e (t),0)) 



so that 



H(z,0)=Cezi n h. 



Here C denotes the Lie derivative. 

Define the averaged vector field H by 




(2.2) 



Then H is C . Fix a compact set VcM, and introduce the slow time 



r = et. 
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Let h(r, z) = h(r) be the random variable that is the solution of 

dh - - 

- = H{h), h(0) = h £ (o). 

We only consider the dynamics in a compact subset of phase space, so for initial 
conditions z G h~ l U, define the stopping time 

T £ (z) = T £ = inf{r > : h(r) £ V or h £ (r/e) £ V}. 

Heuristically, think of the phase space M. as being a fiber bundle whose base 
is the open set U and whose fibers are the compact sets M. h . See Figure 12.11 
Then the vector field Z(-,0) is perpendicular to the base, so its orbits z (t) flow 
only along the fibers. Now when < £ < 1, the vector field Z(-,e) acquires a 
component of size 0(e) along the base, and so its orbits z £ (t) have a small drift 
along the base, which we can follow by observing the evolution of h £ (t). Because 
of this, we refer to h as consisting of the slow variables. Other variables, used 
to complete h to a parameterization of (a piece of) phase space, are called fast 
variables. Note that h £ (t) depends on all the dimensions of phase space, and so it 
is not the flow of a vector field on the m-dimensional space 14. However, because 
the motion along each fiber is relatively fast compared to the motion across fibers, 
we hope to be able to average over the fast motions and obtain a vector field 
on U that gives a good description of h £ (t) over a relatively long time interval, 
independent of where the solution z £ (t) started on A4h £ (o)- Because our averaged 
vector field, as defined by Equation (12.21) . only accounts for deviations of size 0(e), 
we cannot expect this time interval to be longer than size 0(1/ e). In terms of 
the slow time r = et, this length becomes 0(1). In other words, the goal of the 
first-order averaging method described above should be to show that, in some 
sense, sup 0<T<1ATe \h £ (r/e) — h(r)\ — > as e — > 0. This is often referred to as the 
averaging principle. 

Note that the assumptions of regularity, existence of smooth integrals, com- 
pactness, and preservation of smooth measures above are not sufficient for the 
averaging principle to hold in any form. As an example of just one possible 
obstruction, the level sets M. c could separate into two completely disjoint sets, 
M. c = M.^ U M.~ . If this were the case, then it would be implausible that the 
solutions of the averaged vector field defined by averaging over all of M. c would 
accurately describe h £ (t,z), independent of whether z G M.^ or z G M.~ . 

Some averaging results 

So far, we are in a general averaging setting. Frequently, one also assumes that the 
invariant submanifolds, Aih, are tori, and that there exists a choice of coordinates 

z = (h,(p) 
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Figure 2.1: A schematic of the phase space M.. Note that although the level set 
M-c = {h = c} is depicted as a torus, it need not be a torus. It could be any 
compact, co-dimension m sub manifold. 

on M. in which the differential equation (12 .ip takes the form 

^ = eH(h,<p,e), ^ = $(/i^,e). 

Then if ip G S 1 and the differential equation for the fast variable is regular, 
i.e. <p, 0) is bounded away from zero for h G U, 

sup sup \h £ (r/e) — h(r) \ = 0(e) as e — > 0. 

initial conditions 0<t<1AT £ 

s.t. h e {o)ev 

See for example Chapter 5 in |SV85j , Chapter 3 in |LM88j , or Theorem 12.2.31 in 
the following section. 

When the differential equation for the fast variable is not regular, or when 
there is more than one fast variable, the typical averaging result becomes much 
weaker than the uniform convergence above. For example, consider the case when 
(p G T", n > 1, and the unperturbed motion is quasi-periodic, i.e. $(/t, <p, 0) = 
Q(h). Also assume that H G C n+2 and that Q is nonvanishing and satisfies a 
nondegeneracy condition on U (for example, Q : U —>■ T n is a submersion). Let P 
denote Riemannian volume on A4. Neishtadt [LM88t INei76 ] showed that in this 
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situation, for each fixed 5 > 0, 



P[ sup \h e {r/e) -h(r)\ > 8 



,0<r<lAT e 



) 



and that this result is optimal. Thus, the averaged equation only describes the 
actual motions of the slow variables in probability on the time scale 1/e as e — > 0. 

Neishtadt's result was motivated by a general averaging theorem for smooth 
systems due to Anosov. This theorem requires none of the additional assumptions 
in the averaging results above. Under the conditions of regularity, existence of 
smooth integrals, compactness, and preservation of smooth measures, as well as 

• Ergodicity: for Lebesgue almost every cGW, (;z (-),/i c ) is ergodic, 

Anosov showed that sup 0<r<1AT£ \h e (r/e) — h(r)\ — » in probability (w.r.t. Rie- 
mannian volume on initial conditions) as e — > 0, i.e. 

Theorem 2.1.1 (Anosov's averaging theorem |Ano60j ). For each T > and for 
each fixed 5 > 0, 



as e — > 0. 

We present a recent proof of this theorem in Section 12.31 below. 

If we consider h £ (-) and h(-) to be random variables, Anosov's theorem is a 
version of the weak law of large numbers. In general, we can do no better: There 
is no general strong law in this setting. There exists a simple example due to 
Neishtadt (which comes from the equations for the motion of a pendulum with 
linear drag being driven by a constant torque) where for no initial condition in a 
positive measure set do we have convergence of h e (t) to h(et) on the time scale 1/e 
as e — ► [Kif 04b] . Here, the phase space is R x S l , and the unperturbed motion 
is (uniquely) ergodic on all but one fiber. 

2.2 Some classical averaging results 

In this section we present some simple, well-known averaging results. See for 
example Chapter 5 in |SV85j or Chapter 3 in [LM88J. 




sup 

0<t<TAT e 
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2.2.1 Averaging for time-periodic vector fields 

Consider a family of time dependent ordinary differential equations 

^ = eH(h,t,e), (2.3) 

indexed by the real parameter e > 0, where h G M m . Fix V CC U C W 71 , and 
suppose 

• Regularity: H G C l {U x K x [0, oo)). 

• Periodicity: There exists T > such that for each h G U, H(h,t,0) is 
T-periodic in time. 

Then 

^=eH(h,t,0) + O(e 2 ). 

Let /i e (£) denote the solution of Equation (12. 3p . We seek a time independent 
vector field whose solutions approximate h e (t), at least for a long length of time. 
It is natural to define the averaged vector field H by 

r 



i r 

H(h) = — H(h,s,0)ds. 
* Jo 



Then H G C l (U). Let h(r) be the solution of 

^ = H(h), h(0) = Mo). 

It is reasonable to hope that h(et) and h £ (t) are close together for < t < 
We only consider the dynamics in a compact subset of phase space, so for initial 
conditions in U, we define the stopping time 

T £ = inf{r > : h(r) £ V or h e {r/e) £ V}. 

Theorem 2.2.1 (Time-periodic averaging). For each T > 0, 

sup sup \h £ (r/e) — h(r)\ = 0(e) as e — ► 0. 

h e (o)ev 0<t<TAT e 
Proof. We divide our proof into three essential steps. 
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Step 1: Reduction using Gronwall's Inequality. Now, h(r) satisfies the 
integral equation 

h( T ) - h(0) = [ H(h(a))da, 
Jo 

while h £ {r/e) satisfies 

-r/e 



f-r/e 

h £ (r/e) - h E (0) = e / H(h £ (s), s, e)ds 
Jo 

i-t/e 

= 0(e) + e / H{h e {s),s,0)ds 
Jo 



/■t/e ft 

0(e) + e / H(h e (s),s,0) - H(h £ (s))ds + / H(h £ (a/e))da 
Jo Jo 



for < r < T AT £ . 
Define 



e e (r) = e [ H(h e (s), s, 0) - H(h £ (s))ds. 
Jo 

It follows from Gronwall's Inequality that 

sup \h(r) - h £ (r/e)\ < ( 0(e) + sup |e £ (r)|] e Lip ( 5|v ) T . 

Step 2: A sequence of times adapted for ergodization. Ergodization refers 
to the convergence along an orbit of a function's time average to its space average. 
We define a sequence of times tf. for k > by tk = kT . This sequence of times is 
motivated by the fact that 

k+1 H(h o (s),s,0)ds = H(h ). 



tk+l ~ tk Jt 

Note that h (t) is independent of time. Thus 



sup |e e (r)| < 0(e)+e V / ' * + ' H(h e (s), s, 0) - H(h £ (s))ds 

r<T/\T e Ju 



0<r<TAT £ 

tfc-M < 



. TAT t 

k+1 



■ (2-4) 



Step 3: Control of individual terms by comparison with solutions of the 
e = equation. The sum in Equation (\2A\i has no more than 0(l/e) terms, 
and so it suffices to show that each term f* k+1 H(h £ (s), s, 0) — H(h £ (s))ds is no 
larger than 0(e). We can accomplish this by comparing the motions of h £ (t) for 
tk <t < tk+i with hk, e {t), which is defined to be the solution of the e = ordinary 
differential equation satisfying hk, £ (tk) = h £ (t k ), i.e. h kj£ (t) = h £ {%)- 
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□ 



Lemma 2.2.2. Ift k+1 < ^fk ; then sup tfc < t < tfc+i \h Ke {t) - h £ {t)\ = 0(e). 
Proof. dh £ /dt = 0{e). 

Using that H and H are Lipschitz continuous, we conclude that 

k+1 H(h E (s),s,0)-H(h E (s))ds 

i 

k+1 H(h £ (s),s,0)-H(h k , £ (s),s, 0)ds 

k 

+ / " +1 H(h k , £ (s), s, 0) - H(h k , £ (s))ds 
Jt k 

+ I H(h k , E (s)) - H(h £ (s))ds 
Jt k 

=O(e) + + O(e) 
=0{e). 

Thus we see that sup 0<T<TATe \h £ {r/e) —h(r)\ < 0(e), independent of the 
initial condition h £ (0) e V. 

□ 

Remark 2.2.1. Note that the 0(e) control in Theorem 12.2. II on a time scale t = 
0(s^ 1 ) is generally optimal. For example, take H(h,t,e) = cos(t) + e. 

2.2.2 Averaging for vector fields with one regular fast vari- 
able 

For h G M. m and (p G S 1 = [0, l]/0 ~ 1, consider the family of ordinary differential 
equations 

^^eH(h, V ,e), ^ = <S>( h ,<p,e), (2.5) 

indexed by the real parameter e > 0. With z = (h,(p), we write this family of 
differential equations as dz/dt = Z(z,e). 
Fix V CC U C R m , and suppose 

• Regularity: Z eC l (U x S l x [0, oo)). 

• Regular fast variable: 0) is bounded away from for h G i.e. 

inf |$(/i,p,0)| > 0. 

Without loss of generality, we assume that <&(h, <p, 0) > 0. 
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Let z e (t) = (h £ (t) , <p £ (t)) denote the solution of Equation (12.51) . Then z Q (t) 
leaves invariant the circles M. c = {h = c} in phase space. In fact, z (t) preserves 
an uniquely ergodic invariant probability measure on Ai c , whose density is given 
by 

where K c = L , l dlp n , is a normalization constant. 

The averaged vector field H is defined by averaging H (h, ip, 0) over p: 

H{h)= f H(h,p>,0)d» h (p>) = -L f' ^u^nW 

Jo K h Jo ®i h i ¥i 0) 

Then H e C l (U). Let h{r) be the solution of 

^ = H(h), h(0) = h e (0)- 

For initial conditions in U x S 1 , we have the usual stopping time T £ = inf{r > : 
h(r) i V or he(r/e) £ V}. 

Theorem 2.2.3 (Averaging over one regular fast variable). For each T > 0, 
sup sup \h e {r/e) — h(r)\ = 0(e) as e — » 0. 

initial conditions 0<t<TAT e 
s.t. fc e (0)6V 

Remark 2.2.2. This result encompasses Theorem 12 . 2 . 1 1 for time-periodic averaging. 
For example, if T = 1, simply take <p = t mod 1 and e) = 1. 

Remark 2.2.3. Many of the proofs of the above theorem of which we are aware 
hinge on considering ip as a time-like variable. For example, one could write 

alh dh dt H{h, <p, 0) . 2 . 
dp ~ ~dtdp ~ £ $(h,(p,0) + ^ 

and this looks very similar to the time-periodic situation considered previously. 
However, it does take some work to justify such arguments rigorously, and the 
traditional proofs do not easily generalize to averaging over multiple fast variables. 
Our proof essentially uses p to mark off time, and it will immediately generalize 
to a specific instance of multiphase averaging. 

Proof. Again, we have three steps. 
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Step 1: Reduction using Gronwall's Inequality. Now 

h(r) - h(0) = [ H(h(a))da, 
Jo 

and 

h e (r/e)-h e (0)=e [ ' H(z e (s), e)ds = O(e) + e [ ' H(z £ (s),0)ds 
Jo Jo 

i-t/e pt 

= 0(e)+e / H(z £ (s),0)- H(h £ (s))ds + / H(h £ (a/e))da 
Jo Jo 

for < t < T A 71. 



Define 

rr/e 



e £ (r)=e[ * H(z e (s),0) - H(h £ (s))ds. 
Jo 

It follows from Gronwall's Inequality that 

sup \h(r)-h £ (T/e)\< (o(e)+ sup \e £ (r)\] e Lip ( s ^) T . 

Step 2: A sequence of times adapted for ergodization. Now for each 
initial condition in our phase space and for each fixed e, we define a sequence of 
times t k>£ and a sequence of solutions z k>£ (t) inductively as follows: £ ,e — and 
zo,e(J) = z o(t)- For k > 0, t Ke = mi{t > t k _ h£ : </?fc-i, £ (0 = ¥> e (0)}, and z Ke {t) is 
defined as the solution of 



dzk, £ 



= Z(z k;£ , 0) = (0, $(z fc , e , 0)), z k;£ (t k;£ ) = z £ (t k , £ ). 



dt 

This sequence of times is motivated by the fact that 

■y ftk+i,s 



tk+l,e tk,e Jt k ,e 



H(z k , £ (s),0)ds = H(h k , £ 



Recall that h k , £ {t) is independent of time. The elements of this sequence of times 
are approximately uniformly spaced, i.e. if we fix uo > such that 2; 6 V x S 1 4> 
1/uj < $(z, 0) < u, then if t k+1>£ < (T A T £ )/e, l/uo < t k+1>£ - t Ke < u. 
Thus, 



sup 

0<T<TAT e 



\e £ (r)\ < 0(e) + e £ 



■k+l,e 



t 

" H(z £ (s),0) - H(h e (s))ds 

tk,e 



where the sum in in this equation has no more than 0{l/e) terms. 
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Step 3: Control of individual terms by comparison with solutions along 
fibers. It suffices to show that each term / tfc fc+1,e H(z e (s),0) — H(h £ (s))ds is no 
larger than 0(e). We can accomplish this by comparing the motions of z e (t) for 
tk,e <t < t k+1<£ with z k>£ (t). 

Lemma 2.2.4. Ift k+lt£ < then sup tM < t < t( , +le \z kj£ {t) - z £ (t)\ = 0(e). 

Proof. Without loss of generality, we take k = 0, so that z k>£ (t) = z (t). Since 
ho(t) = h £ (0) and dh £ /dt = 0{e), su Ptoe < t < tM \h {t) - h £ (t)\ = 0{e). 

Now ip e (t) — <^ e (0) = f*$(h e (s),ip e (s),e)ds, and because $ is Lipschitz, we 
find that 

\<Pe(s) ~ <Po(,s)\ds 



\<p e (t) - <p Q (t)\ <0(e) + U P ($) [ 

Jo 

for < t < u. The result follows from Gronwall's Inequality. 

Using that H and H are Lipschitz continuous, we conclude that 

' +1 ' e H(z £ (s),0)-H(h £ (s))ds 

tk,e 

k+1 ' S H(z £ (s),0)-H(z k , £ (s),0)ds 



□ 



k,s 

+ [ k+1 ' e H(z k M,0)-H(h ktE (s))ds 

+ [ L+l E H(h k , £ (s)) - H(h £ {s))ds 

=O(e) + + O(e) 
=0{e). 

Thus we see that sup 0<r<TATe \h £ {r/e) — h(r)\ = 0(e), independent of the 
initial condition (h £ (0),ip £ (0)) G V x S l . 

□ 



2.2.3 Multiphase averaging for vector fields with separa- 
ble, regular fast variables 

As explained in Section I2.1[ when the differential equation for the fast variable is 
not regular, or when there is more than one fast variable, the typical averaging 
result becomes much weaker than the uniform convergence in Theorems 12.2.11 
and 12.2.31 above. Nonetheless, if the differential equations under consideration 
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satisfy some very specific hypotheses, the proof in the previous section immediately 
generalizes to yield uniform convergence. 

For h G R m and ip = (ip 1 , ■ ■ ■ , ip n ) G T n = ([0, l]/0 ~ l) n , consider the family 
of ordinary differential equations 

^ = eH(h,ip,e), ^ = $(h,(p,e), (2.6) 

indexed by the real parameter e > 0. We also write z = (h, ip) and dz/dt = Z(z, e). 
Fix V CC U C R m , and suppose 

• Regularity: Z G C\U x T n x [0, oo)). 

• Separable fast variables: H{h, ip, 0) and 0) have the following specific 
forms: 

— There exist C 1 functions Hj(h, ip') such that H(h, ip, 0) = Yll=i Hj(h, <pi). 
This can be thought of as saying that, to first order in e, each fast vari- 
able affects the slow variables independently of the other fast variables. 

— The components $ J of $ satisfy &(h,ip,0) = $ J (/t, (pi, 0), i.e. the un- 
perturbed motion has each fast variable moving independently of the 
other fast variables. Note that this assumption is satisfied if the unper- 
turbed motion is quasi-periodic, i.e. <&(/i, <p,0) = Q(h). 

• Regular fast variables: For each j, 

inf |$ J '(/i,<^,0)| > 0. 



Let z e (t) = (h £ (t),ip £ (t)) denote the solution of Equation (12.61) . Then z Q (t) 
leaves invariant the tori Ai c = {h = c} in phase space. In fact, Zo(t) preserves a 
(not necessarily ergodic) invariant probability measure on M. c , whose density is 
given by 



1 



dip 



3= 

'1 dtpi 



where K{ = J Q |^ (C) ^ )0)r 

The averaged vector field H is defined by 



3(h) = [ H(h, ip, 0)dfi h (p>) =J2[ Hj(h, ^)dpL h {ip) 
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Let h(r) be the solution of 



— = H(h), h(0) = h £ (0), 
ar 

and the stopping time T £ = inf{r > : h(r) ^ V or h £ (r/e) ^ V}. 

Theorem 2.2.5 (Averaging over multiple separable, regular fast variables). For 

each T > 0, 

sup sup \h £ (r/e) — h(r)\ = O(e) as e — ► 0. 

initial conditions 0<t<TAT e 

s.t. h s (o)eV 

Proof. The proof is essentially the same as the proof of Theorem 12.2.31 As before, 
we need only show that supg^^j^jj. |e e (r)| = 0{e), where 

e £ (r)=e [ ' H(z E (s),0) - H(h e (s))ds. 
Jo 

But by our separability assumptions, it suffices to show that for each j, 

sup |e ij£ (r)| = 0(e), 

0<r<TAT e 

where e 3 - je (r) is defined by 

e h£ (r) = e f ' H^s), <4(s)) - H 3 (h e (s))ds. 
Jo 

Thus, we have effectively separated the effects of each fast variable, and now the 
proof can be completed by essentially following steps 2 and 3 in the proof of 
Theorem 12.2.31 

□ 

2.3 A proof of Anosov's theorem 

Anosov's original proof of Theorem 12.1.11 from 1960 may be found in |Ano 60j . An 
exposition of the theorem and Anosov's proof in English may be found in [LM88J. 
Recently, Kifer [Kif04a] proved necessary and sufficient conditions for the averaging 
principle to hold in an averaged with respect to initial conditions sense. He also 
showed explicitly that his conditions are met in the setting of Anosov's theorem. 
The proof of Anosov's theorem given here is mainly due to Dolgopyat |Dol05j . 
although some further simplifications have been made. 
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Proof of Anosov 's theorem. We begin by showing that without loss of generality 
we may take T £ = oo. This is just for convenience, and not an essential part of 
the proof. To accomplish this, let ip(h) be a smooth bump function satisfying 

• rj}{h) = 1 if h e V, 

• ip(h) > if h G interior(V), 

• rj}{h) = if h i V, 

where V is a compact set chosen such that V CC interior (V) CC W. Next, 
set Z(z,e) = ip(h(z))Z(z,e). Because the bump function was chosen to depend 
only on the slow variables, our assumption about preservation of measures is still 
satisfied; on each fiber, Z(z,0) is a scaler multiple of Z(z,0). Furthermore, the 
flow of Z(-, 0)|x h is ergodic for almost every h e V. Then it would suffice to prove 
our theorem for the vector fields Z(z,e) with the set V replacing V. We assume 
that this reduction has been made, although we will not use it until Step 5 below. 

Step 1: Reduction using Gronwall's Inequality. Observe that h(r) satisfies 
the integral equation 

h( T ) - h(0) = [ H(h(a))da, 
Jo 

while h £ {r/e) satisfies 

h £ (r/e) - h e (0) = e [ ' H(z £ (s),e)ds 
Jo 

= 0(e)+e [ H(z £ (s),0)ds 
Jo 

pr/e pr 

= 0(e)+e H(z £ (s),0) - H(h e (s))ds+ H(h £ (a/e))da 
Jo Jo 

for < t < T A T £ . Here we have used the fact that h^V x [0, e ] is compact to 
achieve uniformity over all initial conditions in the size of the 0{e) term above. We 
use this fact repeatedly in what follows. In particular, H, H, and Z are uniformly 
bounded and have uniform Lipschitz constants on the domains of interest. 
Define 

e e (r) = e [ H(z £ (s), 0) - H(h £ (s))ds. 
Jo 

It follows from Gronwall's Inequality that 
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Step 2: Introduction of a time scale for ergodization. Choose a real-valued 
function L{e) such that L(e) — > oo, L(e) = o(log£ _1 ) as e — > 0. Think of L(e) as 
being a time scale which grows as e — > so that ergodization, i.e. the convergence 
along an orbit of a function's time average to a space average, can take place. 
However, L(e) doesn't grow too fast, so that on this time scale z £ (t) essentially 
stays on one fiber, where we have our ergodicity assumption. Set t^ e = kL(e), so 
that 



TATg 



-1 



sup |e e (r)| < 0(eL(e)) +e 



0<t<TAT, 



k=0 



tfe+i, 



H(z £ (s),0)-H(h £ (s))ds 



tk,e 



(2.8) 



Step 3: A splitting for using the triangle inequality. Now we let Zk, £ (s) 

be the solution of 



dz, 



k.e 



dt 



= Z(z kt£ , 0), Z K e{t K£ ) = Z e (t k , e ). 



Set hk, £ (t) = h(zk, e (t)). Observe that hk, e (t) is independent of t. We break up the 

rtk- 

ftk+l,e 



integral / t * fc+M H(z £ (s), 0) — H(h £ (s))ds into three parts: 



/ 



H(z e (s),0)-H(h £ (s))ds 



*fc+l,e 

iJ(z e ( S ),0)-iJ(^ £ ( S ),0)rf S 
+ / H(z k , £ (s),0)-H(hk, £ (s))ds 

+ / H(h k , £ (s))ds-H(h £ (s))ds 

:=I k , £ + /4, £ + //4, £ . 

The term i7fc )£ represents an "ergodicity term" that can be controlled by our 
assumptions on the ergodicity of the flow zo(t), while the terms Ik, £ and IIIk, £ 
represent "continuity terms" that can be controlled using the following control on 
the drift from solutions along fibers. 



Step 4: Control of drift from solutions along fibers. 
Lemma 2.3.1. If < t k+ i, £ < X/ ^ k , 

sup \z k , £ (t) - z £ (t)\ < 0(eL(e)e L ^ L ^) 



22 



Proof. Without loss of generality we may set k = 0, so that z kt£ (t) = z (t). Then 
for < t < L(e), 



\z Q (t) - z £ (t)\ = [ Z(z (s),0)-Z(z £ (s),e)ds 
Jo 



< Lip (Z) / \e\ + \z (s) — z £ (s)\ ds 
Jo 

= 0(eL(e)) + Up(Z) [ \z (s)-z £ (s)\ds. 

Jo 

The result follows from Gronwall's Inequality. 

From Lemma DH we find that I Ke ,III Ke = 0{eL(e) 2 e u ^ L ^). 



□ 



Step 5: Use of ergodicity along fibers to control IIk, £ - From Equations 
( 12. 71) and ( 12. 8ft and the triangle inequality, we already know that 



sup \h(r) - h £ (r/e)\ 

0<t<TAT £ 



T 



< 0(e) + 0{eL{e)) + e——0{eL{e) 2 e^ z)L{e) ) + O 

£±j\E ) 

= 0(eL(e)e Up(z)L(£) ) + O 



I k=0 



/ m- 1 \ 



v 



k=0 



J 



Fix 5 > 0. Recalling that T e = oo, it suffices to show that 



-1 



k=0 



(2.9) 



as £ — > 0. 



For initial conditions z6M and for < /c < define 



fe.e 



1 ITT I 6 

: : \II kj£ \ > — 



Think of these sets as describing "bad ergodization." For example, roughly speak- 
ing, z e Bk, e if the orbit z £ (t) starting at z spends the time between tk, e and £fe+i j£ 



23 



in a region of phase space where the function H(-, 0) is "poorly ergodized" on the 
time scale L(e) by the flow z (t) (as measured by the parameter 5/2T). As IIk,e 
is clearly never larger than 0(L(e)), it follows that 

eLye) ^ 

z E \Ih,e\<^ + 0(eL(e)#(B z , E )). 

k=0 

Therefore it suffices to show that 

P (#(B ze ) > 6 —ttt) -> 

V V ' ; ~ const sL(e) J 

as e — > 0. By Chebyshev's Inequality, we need only show that 

^ i 

E(eL(e)#(B z , e )) = eL(e) £ 

tends to with e. 

In order to estimate the size of P(Bk, £ ), it is convenient to introduce a new 
measure P? that is uniformly equivalent to the restriction of Riemannian volume 
P to hr l V . Here the / stands for "factor," and P* is defined by 

(IP* = dh ■ dnh, 

where dh represents integration with respect to the uniform measure on V. 

Observe that £>o )£ = z e (tk, 6 )Bk, £ - In words, the initial conditions giving rise 
to orbits that are "bad" on the time interval [tk, £ ,tk+i,s], moved forward by time 
tk t£ , are precisely the initial conditions giving rise to orbits that are "bad" on the 
time interval [to jE , ti iS \. Because the flow z (-) preserves the measure we expect 
Pf(Bo, £ ) and Pf(Bk, £ ) to have roughly the same size. This is made precise by the 
following lemma. 

Lemma 2.3.2. There exists a constant K such that for each Borel set B C M. 
and each t E [-T/e,T/e], P f {z £ {t)B) < e KT P f (B). 

Proof. Assume that P f (B) > 0, and set 7(f) = ln(P' \z £ (t)B) / 'P* \B)) . Then 
7(0) = 0, and 

dn^ _ Ttk(t) B f( z ) dz = Lw div P iZ{z,e)dz 
dt L(t)B f(z)dz j zAt)B f(z)dz 

where / > is the C 1 density of P? with respect to Riemannian volume on 
h^V, dz represents integration with respect to that volume, and div P jZ(z, e) = 
div z f(z)Z(z,e). Because zo(t) preserves P* , divpfZ(z, 0) = 0. By Hadamard's 
Lemma, it follows that div P / Z(z, e) = 0{e) on the compact set h~ x V . Hence 
d^{t)/dt = 0(e), and the result follows. □ 
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Returning to our proof of Anosov's theorem, it suffices to show that 



P f (Bo tl 



dh ■ /i h \z : — -r 

v I M £ J 



L(e) 



H{zo(s),0)-H(ho(0))ds 



> 



2T 



tends to with s. By our ergodicity assumption, for almost every h, 



1 



Lis) 



L(e) 



H(z o (s),0)-H(h o (0))ds 



> — J. as ; — 0. 
~ 2T 



Finally, an application of the Bounded Convergence Theorem finishes the proof. 



□ 



2.4 Moral 



From the proofs of the theorems in this chapter, it should be apparent that there 
are at least two key steps necessary for proving a version of the averaging principle 
in the setting presented in Section 12.11 

The first step is estimating the continuity between the s = and the s > 
solutions of 

In particular, on some relatively long timescale L = L(s) <C s 1 , we need to show 
that 

sup \z (t) - z e {t)\ -> 

0<t<L 

as e — > 0. As long as L is sub-logarithmic in s~ l , such estimates for smooth 
systems can be made using Gronwall's Inequality. 

The second step is estimating the rate of ergodization of H(-, 0) by z (t), i.e. es- 
timating how fast 

j J H(z (s),0)ds^H(h ) 

(generally as L — > oo). Note that the estimates in this step compete with those 
in the first step in that, if L is small we obtain better continuity, but if L is large 
we usually obtain better ergodization. Also, we do not need the full force of the 
assumption of ergodicity of (zo(t),fi h ) on the fibers M. h . We only need z (t) to 
ergodize the specific function H(-, 0). Compare the proof of Theorem 12. 2. 51 

Note that in the setting of Anosov's theorem, uniform ergodization leads to 
uniform convergence in the averaging principle. Returning to the proof of Theo- 
rem 12.1.11 above, suppose that 

1 rL(e) 

— - / H(z (s),0)ds^H(h ) 
L \ £ ) Jo 
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uniformly over all initial conditions as L(e) — > oo. Then for all e sufficiently small 
and each k, £>fc j£ = 0, and hence for all e sufficiently small and each z, #(B Zt£ ) = 0. 
From Equation (12. 9p . it follows that sup 0<r<TATe \h(r) — h £ (r/e)\ — > as e — > 0, 
uniformly over all initial conditions z G h~ 1 V. However, uniform convergence in 
Birkhoff 's Ergodic Theorem is extremely rare and usually comes about because of 
unique ergodicity, so it is unreasonable to expect this sort of uniform convergence 
in most situations where Anosov's theorem applies. 
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Chapter 3 

Results for piston systems in one 
dimension 

In this chapter, we present our results for piston systems in one dimension. These 
results may also be found in |Wri06] . 

3.1 Statement of results 

3.1.1 The hard core piston problem 

Consider the system of n\ + n 2 + 1 point particles moving inside the unit interval 
indicated in Figure [3TTI One distinguished particle, the piston, has position Q and 
mass M. To the left of the piston there are n\ > particles with positions qij 
and masses mij, 1 < j < ni, and to the right there are n 2 > particles with 
positions q 2 j and masses m 2 j, 1 < j < These gas particles do not interact 
with each other, but they interact with the piston and with walls located at the 
end points of the unit interval via elastic collisions. We denote the velocities by 
dQ/dt = V and dxij/dt = Vij. There is a standard method for transforming 
this system into a billiard system consisting of a point particle moving inside an 
(ni + n 2 + l)-dimensional polytope ('.\I()(ia . but we will not use this in what 
follows. 

We are interested in the dynamics of this system when the numbers and masses 
of the gas particles are fixed, the total energy is bounded, and the mass of the 
piston tends to infinity. When M = oo, the piston remains at rest, and each 
gas particle performs periodic motion. More interesting are the motions of the 
system when M is very large but finite. Because the total energy of the system is 
bounded, MV 2 /2 < const, and so V = 0(M~ 1 ^ 2 ). Set 

e = M" 1 / 2 , 
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Figure 3.1: The piston system with ri\ — 3 and n 2 = 4. Note that the gas particles 
do not interact with each other, but only with the piston and the walls. 



and let 

w = v -, 

e 

so that 

dt 

with W = 0(1). 

When e — 0, the system has n 1 +n 2 + 2 independent first integrals (conserved 
quantities), which we take to be Q, W, and = \v it j\, the speeds of the gas 
particles. We refer to these variables as the slow variables because they should 
change slowly with time when e is small, and we denote them by 

h = (Q, W, Si,i, Si,2, • • • , Sl,m, S 2) l, S 2) 2, • • • , S 2 ,n 2 ) G R™ 1+ " 2 + 2 . 

We will often abbreviate by writing ft, = (Q,W, s± t j, s 2 j). Let h e (t,z) = h e (t) 
denote the dynamics of these variables in time for a fixed value of e, where z 
represents the dependence on the initial condition in phase space. We usually 
suppress the initial condition in our notation. Think of h £ { ) as a random variable 
which, given an initial condition in the 2(ni + n 2 + l)-dimensional phase space, 
produces a piecewise continuous path in W ll+m+2 . These paths are the projection 
of the actual motions in our phase space onto a lower dimensional space. The goal 
of averaging is to find a vector field on R^i+^+s w hose orbits approximate h e (t). 



28 



The averaged equation 

Sinai [Sin99j derived 



d_ 

dr 



~Q~ 




w 


= H{h) := 




, S2 >l 





w 

Q l-Q 



Q 

^ l-Q 



(3.1; 



as the averaged equation (with respect to the slow time r = et) for the slow 
variables. We provide a heuristic derivation in Section 13.21 Sinai solved this 
equation as follows: From 



d\n{s 



1,0 J 



d\n{Q) 



dr dr 

sid(f) = Sij {0)Q{0)/Q{t). Similarly, s 2j (r) = s 2j (0)(l-Q(0))/(l-Q(r)). Hence 

d 2 Q _ Y.jUmij8u{o) 2 Q{o) a e;:i^2,^ 2j (o) 2 (i - g(o)) 2 



dr 2 



and so (Q, W) behave as if they were the coordinates of a Hamiltonian system 
describing a particle undergoing periodic motion inside a potential well. If we let 



777,j j 2 



'J 



3=1 



be the kinetic energy of the gas particles on one side of the piston, the effective 
Hamiltonian may be expressed as 



1, 



-W + 



E 1 (0)Q(0y , E 2 (0)(1 - Q(0))'< 



Q 2 



+ 



(l-Q) 2 



(3.2) 



Hence, the solutions to the averaged equation are periodic for all initial conditions 
under consideration. 



Main result in the hard core setting 

The solutions of the averaged equation approximate the motions of the slow vari- 
ables, h £ (t), on a time scale 0(l/e) as e — > 0. Precisely, let h(r,z) = h{r) be the 
solution of 

^ = H(h), h(0) = h e (0)- 
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Again, think of h(-) as being a random variable that takes an initial condition in 
our phase space and produces a path in M. ni+n2+2 . 

Next, fix a compact set V C R"i+™2+ 2 such that h G V Q CC (0, 1), VF CC 
R, and Sj >3 - CC (0, oo) for each z and j'0 For the remainder of this discussion we 
will restrict our attention to the dynamics of the system while the slow variables 
remain in the set V. To this end, we define the stopping time 

T £ (z) = T £ := inf{r > : h(r) <£ V or h e (r/e) £ V}. 

Theorem 3.1.1. For each T > 0, 

sup sup \h e (r/e) - h(r)\ = 0(e) as e = M~ 1/2 -> 0. 

initial conditions 0<t<TAT e 
s.t. h s (0)eV 

This result was independently obtained by Gorelyshev and Neishtadt [GN06J. 

Note that the stopping time does not unduly restrict the result. Given any 
c such that h = c => Q E (0,1), s it j G (0, oo), then by an appropriate choice of 
the compact set V we may ensure that, for all e sufficiently small and all initial 
conditions in our phase space with h £ (0) = c, T £ > T. We do this by choosing 
V 3 c such that the distance between dV and the periodic orbit h(r) with h(0) = c 
is positive. Call this distance al. Then T £ can only occur before T if h e (r/e) has 
deviated by at least d from h(r) for some r G [0, T). Since the size of the deviations 
tends to zero uniformly with e, this is impossible for all small e. 

3.1.2 The soft core piston problem 

In this section, we consider the same system of one piston and gas particles inside 
the unit interval considered in Section 13.1.11 but now the interactions of the gas 
particles with the walls and with the piston are smooth. Let k: K — » R be a C 2 
function satisfying 

• k(x) — if x > 1, 

• k'(x) < if x < 1. 

Let S > be a parameter of smoothing, and set 

Ks(x) = k(x/5). 

1 We have introduced this notation for convenience. For example, h E V Q CC (0, 1) 
means that there exists a compact set A C (0, 1) such that h G V =>■ Q G A, and similarly for 
the other variables. 
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Then consider the Hamiltonian system obtained by having the gas particles inter- 
act with the piston and the walls via the potential 




^2 Ks(qij) + k s (Q - qij) + K s(Q2,j -Q) + K S (1- q 2 ,j)- 




As before, we set e = M 1//2 and W = V/e. If we let 

E ltj = -m 1}j vf d + K S (qij) + ks{Q ~ qij), 1 < j < rii, 
I 

E 2 ,j = ^ m 2,j v 2,j + K s(q2j - Q) + ks{1 - ?2j), 1 < j < n 2 , 



(3.3) 



then Eij may be thought of as the energy associated with a gas particle, and 
W 2 /2 + Y,%i Eij + E"=i E 2,j is the conserved energy 

When e = 0, the Hamiltonian system admits n\ + n 2 + 2 independent first 
integrals, which we choose this time as h = (Q,W, E lt j, E 2 j). While discussing 
the soft core dynamics we use the energies rather than the variables Sjj = 
^j2E it j/rriij, which we used for the hard core dynamics, for convenience. 

For comparison with the hard core results, we formally consider the dynamics 
described by setting 5 = to be the hard core dynamics described in Section l3.1.1l 
This is reasonable because we will only consider gas particle energies below the 
barrier height k(0). Then for any e, 5 > 0, h s £ (t) denotes the actual time evolution 
of the slow variables. While discussing the soft core dynamics we often use 5 as a 
superscript to specify the dynamics for a certain value of 5. We usually suppress 
the dependence on 5, unless it is needed for clarity. 

Main result in the soft core setting 

We have already seen that when S = 0, there is an appropriate averaged vector 
field H° whose solutions approximate the actual motions of the slow variables, 
h®(t). We will show that when 5 > 0, there is also an appropriate averaged vector 
field H s whose solutions still approximate the actual motions of the slow variables, 
h s £ (t). We delay the derivation of H 5 until Section T3.4.11 

Fix a compact set V C W ll+Tl2+2 such that h G V => Q CC (0, 1), W CC R, 
and CC (0, k(0)) for each i and j. For each e, 5 > we define the functions 
h s (-) and T e 5 on our phase space by letting h s (r) be the solution of 



dr 



H\h s ) 



h 5 (o) = h s M, 



(3.4) 



and 



r<5 



inf{r > : h\r) £ V or h s £ 



(r/e) i V}. 
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Theorem 3.1.2. There exists 5q > such that the averaged vector field H 5 (h) is 
C 1 on the domain {(5, h) : < 5 < 5o, h G V}. Furthermore, for each T > 0, 

sup sup sup \h s e {r/e) - h\r)\ = 0(e) as e = M~ 1/2 -> 0. 

0<5<(5o initial conditions 0<t<TAT s 

s.t. h e e (o)eV 

As in Section I3.1.1[ for any fixed c there exists a suitable choice of the compact 
set V such that for all sufficiently small e and 5, T £ > T whenever /if (0) = c. 

As we will see, for each fixed 5 > 0, Anosov's theorem 12.1.11 applies to the soft 
core system and yields a weak law of large numbers, and Theorem 12.2 . 51 applies and 
yields a strong law of large numbers with a uniform rate of convergence. However, 
neither of these theorems yields the uniformity over 5 in the result above. 



3.1.3 Applications and generalizations 

Relationship between the hard core and the soft core piston 

It is not a priori clear that we can compare the motions of the slow variables on 
the time scale 1/e for 5 > versus 5 = 0, i.e. compare the motions of the soft core 
piston with the motions of the hard core piston on a relatively long time scale. It 
is impossible to compare the motions of the fast-moving gas particles on this time 
scale as e — > 0. As we see in Section 13.41 the frequency with which a gas particle 
hits the piston changes by an amount 0{8) when we smooth the interaction. Thus, 
on the time scale 1/e, the number of collisions is altered by roughly 0(5/ e), and 
this number diverges if 5 is held fixed while e — » 0. 

Similarly, one might expect that it is impossible to compare the motions of the 
soft and hard core pistons as e — > without letting 5 — > with e. However, from 
Gronwall's Inequality it follows that if h s (0) = h°(0), then 

sup \h s (r) -h°(r)\ = 0{5). 

0<r<TAT*AT £ ° 

From the triangle inequality and Theorems 13.1.11 and 13 . 1 . 21 we obtain the following 
corollary, which allows us to compare the motions of the hard core and the soft 
core piston. 

Corollary 3.1.3. As e = M~ 1//2 , 5^0, 

sup sup sup \h s £ (t)-h £ (t)\ = O(e) + O(5). 

ceV initial conditions 0<t<(TAT s AT°)/e 
s.t. hi(0)=c=h°(0) ~ E 

This shows that, provided the slow variables have the same initial conditions, 



sup \hl(t) - h° e (t)\ = 0(e) + 0(5). 

0<t<l/e 



Thus the motions of the slow variables converge on the time scale 1/e as e, 5 — ► 
and it is immaterial in which order we let these parameters tend to zero. 
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The adiabatic piston problem 

We comment on what Theorem 13.1.11 says about the adiabatic piston problem. The 
initial conditions of the adiabatic piston problem require that W(0) =0. Although 
our system is so simple that a proper thermodynamical pressure is not defined, 
we can define the pressure of a gas to be the average force received from the gas 
particles by the piston when it is held fixed, i.e. Pi = Y^jLi ^ m i,j s hj^Q = 2Pi/Q 
and P 2 = 2E 2 /(1 - Q). Then if Pi(0) > P 2 (0), the initial condition for our 
averaged equation (13. ip has the motion of the piston starting at the left turning 
point of a periodic orbit determined by the effective potential well. Up to errors not 
much bigger than M~ 1//2 , we see the piston oscillate periodically on the time scale 
M 1/2 . If Pi(0) < P 2 (0), the motion of the piston starts at a right turning point. 
However, if Pi(0) = P2(0), then the motion of the piston starts at the bottom of 
the effective potential well. In this case of mechanical equilibrium, h(r) = h(0), 
and we conclude that, up to errors not much bigger than M -1 / 2 , we see no motion 
of the piston on the time scale M 1 / 2 . A much longer time scale is required to see 
if the temperatures equilibrate. 

Generalizations 

A simple generalization of Theorem 13.1.11 proved by similar techniques, follows. 
The system consists of iV — 1 pistons, that is, heavy point particles, located inside 
the unit interval at positions Qi < Q 2 < ■ ■ ■ < Qn-i- Walls are located at Qq = 
and Qn = 1, and the piston at position Qi has mass M{. Then the pistons divide 
the unit interval into N chambers. Inside the i th chamber, there are n.j > 1 gas 
particles whose locations and masses will be denoted by x i; j and rriij, respectively, 
where 1 < j < rij. All of the particles are point particles, and the gas particles 
interact with the pistons and with the walls via elastic collisions. However, the 
gas particles do not directly interact with each other. We scale the piston masses 
as Mi = Mi/e 2 with Mj constant, define Wi by dQi/dt = eW{, and let Pj be 
the kinetic energy of the gas particles in the i chamber. Then we can find an 
appropriate averaged equation whose solutions have the pistons moving like an 
(N — l)-dimensional particle inside a potential well with an effective Hamiltonian 



If we write the slow variables as h = (Qi, Wi, \vij\) and fix a compact set V such 
that h G V => Q i+ i - Qi CC (0,1), Wi CC E, and \v itj \ CC (0, oo), then the 
convergence of the actual motions of the slow variables to the averaged solutions 
is exactly the same as the convergence given in Theorem 13.1.11 



N-l N 



£i(0)(Qi(0)-Qi-i(0)) 2 
(Qi - Q^i) 2 



i=l 1=1 
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Remark 3.1.1. The inverse quadratic potential between adjacent pistons in the 
effective Hamiltonian above is also referred to as the Calogero-Moser-Sutherland 
potential. It has also been observed as the effective potential created between two 
adjacent tagged particles in a one-dimensional Rayleigh gas by the insertion of one 
very light particle inbetween the tagged particles [BTT07J. 



3.2 Heuristic derivation of the averaged equa- 
tion for the hard core piston 



We present here a heuristic derivation of Sinai's averaged equation (13. ip that is 
found in [Dol05] . 

First, we examine interparticle collisions when e > 0. When a particle on the 
left, say the one at position qij, collides with the piston, sij and W instantaneously 
change according to the laws of elastic collisions: 



(3.5) 



If the speed of the left gas particle is bounded away from zero, and W = M X ' 2 V 
is also bounded, it follows that for all e sufficiently small, any collision will have 
v^j > and vfj < 0. In this case, when we translate Equation (13.51) into our new 
coordinates, we find that 





1 


'mtj - M 2M 






v+_ 


m ld + M 


2mij M — rrixj 




y- 



°1J 
w + 



1 + e 2 mi 



1 — e 2 mij 



-2e 
1 — e 2 m\ 









w- 



(3.6) 



so that 



As ld 
AW 



W + 



w- 



-2eW~ + 0(e 2 ), 
+2em ltj s^ + 0(s 2 



The situation is analogous when particles on the right collide with the piston. 
For all e sufficiently small, S2j and W instantaneously change by 



AW 

As 2J 



W + - W~ 



S 2j S 2,j 



-2em 



-2eW~ + 0(e 2 



We defer discussing the rare events in which multiple gas particles collide with the 
piston simultaneously, although we will see that they can be handled appropriately. 

Let At be a length of time long enough such that the piston experiences many 
collisions with the gas particles, but short enough such that the slow variables 
change very little, in this time interval. From each collision with the particle at 
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position (ji j , W changes by an amount +2em lj sij + 0(e 2 ), and the frequency 
of these collisions is approximately Arguing similarly for collisions with the 
other particles, we guess that 

AW ni s " 2 s 

at = £ E 2 ^ s ^2g ~ e £ 2m2 ^2(r=% + 0( ' 2) - 

j=i i=i 

Note that not only does the position of the piston change slowly in time, but its 
velocity also changes slowly, i.e. the piston has inertia. With r = et as the slow 
time, a reasonable guess for the averaged equation for W is 

dr Q 1 - Q 

Similar arguments for the other slow variables lead to the averaged equation (13. ip . 

3.3 Proof of the main result for the hard core 
piston 

3.3.1 Proof of Theorem [37T7T] with only one gas particle on 
each side 

We specialize to the case when there is only one gas particle on either side of 
the piston, i.e. we assume that rii = n 2 = 1. We then denote by qi, m 2> 2 
by m 2 , etc. This allows the proof's major ideas to be clearly expressed, without 
substantially limiting their applicability. At the end of this section, we outline the 
simple generalizations needed to make the proof apply in the general case. 

A choice of coordinates on the phase space for a three particle system 

As part of our proof, we choose a set of coordinates on our six-dimensional phase 
space such that, in these coordinates, the e = dynamics are smooth. Complete 
the slow variables h = (Q, W, si, s 2 ) to a full set of coordinates by adding the 
coordinates (ft G [0, 1]/ ~ 1 = S 1 , i — 1, 2, defined as follows: 

<Pi = <Pi(qi,vi,Q) 



f2 = V2(q2,V 2 ,Q) 



if v x > 




2(1-0) 



if v 2 < 
if v 2 > 
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When e = 0, these coordinates are simply the angle variable portion of action- 
angle coordinates for an integrable Hamiltonian system. They are defined such 
that collisions occur between the piston and the gas particles precisely when ipi 
or tp 2 = 1/2. Then z = (h, (pi, ip 2 ) represents a choice of coordinates on our phase 
space, which is homeomorphic to (a subset of 1R 4 ) x T 2 . We abuse notation and 
also let h(z) represent the projection onto the first four coordinates of z. 

Now we describe the dynamics of our system in these coordinates. When 
<p!,<P2 ^ 1/2, 



dc Pi _ J ih 
dt 

d(p 2 
~dT 




if < ^1 < 1/2 
if 1/2 <ip x <l 

if < y? 2 < 1/2' 
<p 2 ) if 1/2 < y? 2 < 1 



2(1-Q) 

Hence between interparticle collisions, the dynamics are smooth and are described 
by 



dQ 

dt 
dW 

~dT 

ds\ 

~dl 

ds 2 
~dt 
d<pi 
dt 
d(f2 
dt 



0. 



(3.7) 



S2 



+ 0(e) 



2(1 -Q) 

When ip 1 reaches 1/2, while (p 2 7^ 1/2, the coordinates Q,s 2 ,<fx, and cp 2 are 
instantaneously unchanged, while si and W instantaneously jump, as described by 
Equation (13. 6p . As an aside, it is curious that s^+eW + = s^— eW~ , so that dipi/dt 
is continuous as <p\ crosses 1/2. However, the collision induces discontinuous jumps 
of size 0(e 2 ) in dQ/dt and d^/dt. Denote the linear transformation in Equation 
fl3~6l) with j = 1 by A 1>£ . Then 



1 

2emi 



-2e 
1 



The situation is analogous when (p 2 reaches 1/2, while ipi 7^ 1/2. Then W and 
s 2 are instantaneously transformed by a linear transformation 



.4 



2.6 



1 

2e 



-2em2 
1 
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We also account for the possibility of all three particles colliding simultane- 
ously. There is no completely satisfactory way to do this, as the dynamics have an 
essential singularity near {ipi = (f 2 = 1/2}. Furthermore, such three particle colli- 
sions occur with probability zero with respect to the invariant measure discussed 
below. However, the two 3x3 matrices 



~A l>£ 




"1 " 


1 




A 2i£ _ 



have a commutator of size 0(e 2 ). We will see that this small of an error will make 
no difference to us as e — > 0, and so when tp x = y?2 = 1/2, we pretend that the 
left particle collides with the piston instantaneously before the right particle does. 
Precisely, we transform the variables s±, W, and s 2 by 



We find that 



Asi 
AW 
As 2 



W + 



'i 



1 " 




~A li£ 0" 


A 2 , £ _ 




1 



'1 



w + - w~ 

S 2 ~ S 2 = 



w~ 

$2 



-2eW~ + C(e 2 ), 

= +2emis]" - 2em 2 s 2 + 0(e 2 ) 

+2eW- + 0(e 2 ). 



The above rules define a flow on the phase space, which we denote by z e (t). 
We denote its components by Q E (t), W e (t), Si, e (t), etc. When e > 0, the flow is 
not continuous, and for definiteness we take z £ (t) to be left continuous in t. 

Because our system comes from a Hamiltonian system, it preserves Liouville 
measure. In our coordinates, this measure has a density proportional to Q(l — 
Q). That this measure is preserved also follows from the fact that the ordinary 
differential equation (13. 7p preserves this measure, and the matrices A Xj£ , A 2>£ have 
determinant 1. Also note that the set = ip 2 = 1/2} has co-dimension two, 
and so [J t z £ (t){<pi = (p 2 = 1/2} has co-dimension one, which shows that only a 
measure zero set of initial conditions will give rise to three particle collisions. 



Argument for uniform convergence 

Step 1: Reduction using Gronwall's Inequality. Define H(z) by 



H(z) 



W 

2m 1 s l 8 ipi=1 / 2 - 2m 2 s 2 <^2=i/2 
-2W8 91=1/2 
2W5 ip2=1/2 
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Here we make use of Dirac delta functions. All integrals involving these delta 
functions may be replaced by sums. We explicitly deal with any ambiguities arising 
from collisions occurring at the limits of integration. 

Lemma 3.3.1. For < t < I ^ k , 

F ' 



h £ (t)-h £ (0)=e [ H(z £ (s))ds + 0(e), 
Jo 



where any ambiguity about changes due to collisions occurring precisely at times 
and t is absorbed in the 0(e) term. 

Proof. There are four components to verify. The first component requires that 
Qe(t)-Q £ (0) =e f*W £ (s)ds + 0(e). This is trivially true because Q £ {t) -Q e (0) = 
e /„* W £ (s)ds. 

The second component states that 

W £ {t)-W e {0) = e [ 2m 1 s 1 ^(s)6^ Aa)=1/2 -2m 2 s 2 ^s)6^ Aa)=1/2 ds+0(e). (3.8) 
Jo 

Let r k and qj be the times in (0, £) such that (pi jE (rk) = 1/2 and (p 2j£ (qj) — 1/2) 
respectively. Then 

W £ (t) - W £ (0) = &W £ (r k ) + &W E { qj ) + 0(e). 

Observe that there exists u > such that for all sufficiently small e and all h E V, 
1/oj < ^ < oj. Thus the number of collisions in a time interval grows no faster 
than linearly in the length of that time interval. Because t <T/e, it follows that 

W £ (t) -W £ (0)=eJ2 2m ^iAn) -eY, 2m 2 s 2 , £ (qj) + 0(e), 

and Equation (13.81) is verified. Note that because V is compact, there is uniformity 
over all initial conditions in the size of the 0(e) terms above. The third and fourth 
components are handled similarly. □ 

Next, h(r) satisfies the integral equation 

h( T ) - h(0) = [ H(h(a))da, 
Jo 

while h e (r/e) satisfies 

h £ (r/e) - h £ (0) = 0(e) + e [ ' H(z £ (s))ds 

Jo 



»r/e 

0(e)+e I H(z £ (s))-H(h £ (s))ds+ I H(h £ (a/e))da 
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for < r < T A T £ . 
Define 



r/e 

< ■(-)■ :: / H(z e (s)) - H(h £ (s))ds. 
'o 



It follows from Gronwall's Inequality that 



sup \h(r) — h e {r/e)\ < O(e) + sup |e e (r)| ) e 

0<t<T/\T s ' V 0<r<T/\T £ 



Lip(5| v )r 



(3.9) 



Gronwall's Inequality is usually stated for continuous paths, but the standard 
proof (found in [SV85J) still works for paths that are merely integrable, and 
\h(r) — h e {r/e)\ is piecewise smooth. 



Step 2: A splitting according to particles. Now 



H(z) - H(h) 





2mis 1 5 tpi= i/ 2 - m x s\IQ 
-2W5 Vl=1/2 + s x W/Q 




+ 







-27^252^=1/2 + m 2 sl/ (1 - Q) 


2W8 (p2=1/2 - s 2 W/(l - Q) 



and so, in order to show that sup 0<r<TAT£ |e e (r)| = 0(e), it suffices to show that 

-r/e 
'0 



sup 

0<t<TAT £ 



sup 

0<t<TAT £ 



W e (s)S VlAs)=1/2 



2Q £ (S 

W £ (s)s lt£ (s 
2Q e (s) 



■ds 



o(i), 



as well as two analogous claims about terms involving (p 2)£ . Thus we have effec- 
tively separated the effects of the different gas particles, so that we can deal with 
each particle separately. We will only show that 



sup 

0<r<TAT e 



r/e , , / . \ 1 

M*)«W(.)=i/a 2Qe(s 



S ^ S < ds 



0(1). 



The other three terms can be handled similarly. 



Step 3: A sequence of times adapted for ergodization. Ergodization refers 
to the convergence along an orbit of a function's time average to its space average. 
For example, because of the splitting according to particles above, one can easily 
check that \ J* H(z (s))ds = H(h ) + 0(1/*), even when z (-) restricted to the 
invariant tori M.h is not ergodic. In this step, for each initial condition z £ (0) in 
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our phase space, we define a sequence of times t ky£ inductively as follows: t ,e — 
inf{t > : ipi, B (t) = 0}, t k+1 ^ £ = inf{t > t k>e : </?i, e (t) — 0}. This sequence is 



chosen because 5, 



2 is "ergodizd" as time passes from t k to t k+ i o- If £ is 



¥>i,o(s)=V 

sufficiently small and t k +i,E < (T A T e )/e, then the spacings between these times 
are uniformly of order 1, i.e. 1/ui < t k +i, E — tfc, e < u). Thus, 



sup 

0<t<T/\T e 



<0(1) + £ 



*fe+l,e<- 



* fc+1 ' £ c u gi, £ (g) 2 ^ 



(3.10) 



Step 4: Control of individual terms by comparison with solutions along 
fibers. The sum in Equation (I3.10p has no more than 0(1/ e) terms, and so it 
suffices to show that each term is no larger than 0(e). We can accomplish this by 
comparing the motions of z £ (t) for t).^ < t < t k+ i t£ with the solution of the 6 = 
version of Equation (13.71) that, at time t kjS , is located at z e (tk, e )- Since each term 
in the sum has the same form, without loss of generality we will only examine the 
first term and suppose that t 0j£ = 0, i.e. that <^i, e (0) = 0. 

Lemma 3.3.2. Ift lj£ < then sup < 4 < 4le \z (t) - z £ (t)\ = 0(e). 

Proof. To check that sup 0<t<tle \ho(t) — h £ (t)\ = 0(e), first note that ho(t) = 
h Q (Q) = h £ (0). Then dQ £ /dt = 6(e), so that Q (t)-Q £ (t) = 0(et). Furthermore, 
the other slow variables change by 0(e) at collisions, while the number of collisions 
in the time interval [0, tx }£ ] is 0(1). 

It remains to show that sup 0<4<4i \<p i; o(t) — <pi >e (t)\ = 0(e). Using what we 
know about the divergence of the slow variables, 

<p lfi (t) - Vh£ (t) = f %M - + 0(e)ds = f 0(e)ds = O(e) 

J Q 2Q (s) 2Q £ (s) J 

for < t < ti )£ . Showing that sup 0<t<<1 e \(pz,o(t) — <f2,e(t)\ = 0(e) is similar. 

□ 

From Lemma |3.3.2[ t 1)£ = t 1)0 + 0(e) = 2Q /si i0 + 0(e). We conclude that 

= O(e) + s 1 , ~t 1 ^ = O(e). 

It follows that sup 0<r<TATe \h £ (r/e) — h(r)\ = O(e), independent of the initial 
condition in h~ l V. 
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3.3.2 Extension to multiple gas particles 



When ni,ri2 > 1, only minor modifications are necessary to generalize the proof 
above. We start by extending the slow variables h to a full set of coordinates on 
phase space by defining the angle variables (p^ € [0, 1]/ ~ 1 = S 1 for 1 < i < 2, 

1 < j < TLi. 



V>2j = ¥>2,j(q2,j,V 2 ,j,Q) 



l 



Then d(p ld /dt = s 1J (2Q)- 1 + 0(e), d<p 2 ,j/dt = s 2j (2(l - Q))" 1 + 0(e), and 
z = (h, (fi 7 j , (f2,j) represents a choice of coordinates on our phase space, which 
is homeomorphic to (a subset of lR ni+rt2+2 ) x T ni+712 . In these coordinates, the 
dynamical system yields a discontinuous flow z e (t) on phase space. The flow 
preserves Liouville measure, which in our coordinates has a density proportional 
to Q ni (l — Q) n2 . As is Section 13.3.11 one can show that the measure of initial 
conditions leading to multiple particle collisions is zero. 
Next, define H(z) by 



H(z) 



W 



-2W5, 



2W5, 



V2j=l/2 



For < t < h £ {t) - h £ (0) = e f*H(z e (s))ds + 0(e). From here, the rest of 

the proof follows the same arguments made in Section 13.3.11 



3.4 Proof of the main result for the soft core 
piston 

For the remainder of this chapter, we consider the family of Hamiltonian systems 
introduced in Section I3.1.2[ which are parameterized by e, 5 > 0. For simplicity, 
we specialize to n\ — n 2 — 1. As in Section [3731 the generalization to ni,n 2 > 1 
is not difficult. The Hamiltonian dynamics are given by the following ordinary 
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differential equation: 



dQ 

dt 
dW 

~dT 

dx\ 

~dt 

dv\ 1 
dt mi 

d%2 

~dt 

dv 2 1 



eW, 

e (-k' s (Q - x x ) + k' s (x 2 - Q)) , 
vi, 

(-k's(xi) +k' s (Q-xx)), 

I 

''2- 

(-k' s (x 2 -Q)+k' s (1-x 2 )). 



(3.11) 



dt m 2 

Recalling the particle energies defined by Equation (13. 3p . we find that 

^ = bWk' 5 (Q - Xl ), = -bWk' s (x 2 - Q). 

For the compact set V introduced in Section 13.1.21 fix a small positive number 
S and an open set WcR 4 such that V C U and h e U => Q G (£, 1-6), W CC R, 
and 6 < Ei, E 2 < k(0) — 6. We only consider the dynamics for < 5 < 6/2 and 
heU. 

Define 

Ui(qi) = Ui(qi, Q, S) := K$(gi) + k s (Q - qi), 
U 2 (q 2 ) = U 2 (q 2 , Q, 5) := n 5 (q 2 - Q) + k s (1 - q 2 ). 

Then the energies Ei satisfy Ei = m^vf /2 + Ui(xi). 

Let Ti = T\{Q, Ei, 5) and T 2 = T 2 (Q, E 2 , 5) denote the periods of the motions 
of the left and right gas particles, respectively, when e — 0. 

Lemma 3.4.1. For % = 1,2, 

Ti e C l {(Q, E h 6) :Q E (£,!-£), EiE {£, k{0) -£),0<6< 6/2}. 
Furthermore, 



T 1 (Q,Ei,5) = J^Q + 0(S), 



'1m 

T 2 (Q.E, t S) = % l-p-(l-Q)+0(S). 
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The proof of this lemma is mostly computational, and so we delay it until 
Section 13.51 Note especially that the periods can be suitably defined such that 
their regularity extends to 5 = 0. 

In this section, and in Section 13.51 below, we adopt the following convention 
on the use of the O notation. All use of the O notation will explicitly contain the 
dependence on e and 5 as e, 6 — > 0. For example, if a function f(h,e,S) = 0(e), 
then there exists 5',e' > such that sup 0<e<£ , <5<<5' Aev l/(^> e ^ ^)/ e \ < °°- 

When e — 0, 

Define a = a(Ei, S) by 

Ks(a) = k{cl/8) = Ei, 

so that a(Ei,5) is a turning point for the left gas particle. Then a = 5K~ l (Ei), 
where k~ x is defined as follows: k : [0, 1] — > [0, k(0)] takes to k(0) and 1 to 0. 
Furthermore, k G C 2 ([0,1]), k' < 0, and k'(x) < if x < 1. By monotonicity, 
: [0, k(0)) — > [0, 1] exists and takes to 1 and k(0) to 0. Also, by the Implicit 
Function Theorem, /sT 1 G C 2 ((0, k(0)]), < for y > 0, and -> 

— oo as y — > + . Because we only consider energies E{ G (£, k(0) — £), it follows 
that a(Ei, 5) is a C 2 function for the domains of interest. 



3.4.1 Derivation of the averaged equation 



As we previously pointed out, for each fixed 5 > 0, Anosov's theorem 12.1.11 and 
Theorem 12.2.51 apply directly to the family of ordinary differential equations in 
Equation (13.111) . provided that 5 is sufficiently small. The invariant fibers M. h of 
the e = flow are tori described by a fixed value of the four slow variables and 
{(Q, W, q u v x , q 2 , v 2 ) : E 1 = + U x {q u Q, 5),E 2 = m 2 v 2 2 /2 + U 2 (q 2 , Q, 5)}. 

we use (gi, q 2 ) as local coordinates on Aih, which is valid except when v\ or v 2 = 
the invariant measure fih of the unperturbed flow has the density 

dq x dq 2 



If 
0. 



T 1 J±(E 1 - Ufa)) T 2 J^(E 2 - U 2 (q 2 )) 



The restricted flow is ergodic for almost every h. 

E3J 

Now 

W 



See Corollary 13.5.11 in Section 



dh & £ 
~dt 



-k' s (Q ~ qi) + K' s {q 2 - Q) 
Wn' 5 (Q- qi ) 
-Wn' 5 {q 2 -Q) 
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and 



<<n \a 2 f Q ~° 'a k 's(Q ~ gi) 



v / 2mi 
Ti 



-5 \JE\- Ks(Q - qi] 
f El du 
T\ J sJE x -u 



Similarly, 



y/8m 2 E 2 



It follows that the averaged vector field is 



H\h) 



W ' 

V8mi.Ei V8m,2-E2 

Ti T 2 



where from Lemma EXH we see that £T(-) G C 1 ({(5,/i) : < 5 < £/2,/i G V}). 
H°(h) agrees with the averaged vector field for the hard core system from Equation 
(13. ip . once we account for the change of coordinates E^ = misj/2. 

Remark 3.4.1. An argument due to Neishtadt and Sinai [NS04] shows that the 
solutions to the averaged equation (13. 4p are periodic. This argument also shows 
that, as in the case 5 = 0, the limiting dynamics of (Q, W) are effectively Hamil- 
tonian, with the shape of the Hamiltonian depending on 5, Q(0), and the initial 
energies of the gas particles. The argument depends heavily on the observation 
that the phase integrals 



Ii(Q,Ei,S) 



dxdv 



\miV 2 +Ui(x,Q ,5)<Ei 



are adiabatic invariants, i.e. they are integrals of the solutions to the averaged 
equation. Thus the four-dimensional phase space of the averaged equation is foli- 
ated by invariant two-dimensional submanifolds, and one can think of the effective 
Hamiltonians for the piston as living on these submanifolds. 
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3.4.2 Proof of Theorem 13X21 



The following arguments are motivated by our proof in Section 13.31 although the 
details are more involved as we show that the rate of convergence is independent 
of all small S. 



A choice of coordinates on phase space 

We wish to describe the dynamics in a coordinate system inspired by the one used 
in Section [3.3.11 For each fixed S G (0, Jo], this change of coordinates will be C 1 
in all variables on the domain of interest. However, it is an exercise in analysis to 
show this, and so we delay the proofs of the following two lemmas until Section 

We introduce the angular coordinates ipi G [0, 1]/ ~ 1 = S 1 defined by 



<Pi = <Pi(Qi,Vi,Q) 



¥2 = ¥2(q2,V 2 ,Q) 





j_ rqi 

Tx Ja 

1/2 

1-f In 

1\ Ja 



t 2 hi 
1/2 



mi/2 



-Ui(s) 



<-li 



mi/2 
Ei-Ui(s) 



m 2 /2 
E 2 -U 2 (s) 



1-a 
T 2 Jq 2 



if qi = a 
ds if v± > 

if q± = Q — a 
ds if vi < 

if qi = 1 — a 
ds if v 2 < 

ii q 2 = Q + a 
ds if v 2 > 



(3.12) 



m 2 /2 
E 2 -U 2 (s) 



Then z = (h, ipi, (p%) is a choice of coordinates on h~ l U. As before, we will abuse 
notation and let h(z) denote the projection onto the first four coordinates of z. 

There is a fixed value of So in the statement of Theorem 13.1.21 However, for 
the purposes of our proof, it will be convenient to progressively choose So smaller 
when needed. At the end of the proof, we will have only shrunk So a finite number 
of times, and this final value will satisfies the requirements of the theorem. Our 
first requirement on So is that it is smaller than 8/2. 

Lemma 3.4.2. If So > is sufficiently small, then for each S G (0, So] the ordinary 
differential equation (13. lip in the coordinates z takes the form 



Tt= Z ^l 



(3.13) 



where Z 5 G C l (h~ l U x [0, oo)). When z G h^U, 
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Z s (z,e) 



eW 



s(-k' s (Q - q^z)) + K' s (q 2 (z) 
eWK' s (Q- qi (z)) 
-eWK' 5 {q 2 {z)-Q) 
i + 0(e) 
+ 0(e) 



- Q)) 



T 2 



(3.14) 



Recall that, by our conventions, the O(e) terms in Equation (13.141) have a 
size that can be bounded independent of all S sufficiently small. Denote the flow 
determined by Z s (-,e) by z £ (t), and its components by Qf(i), W £ (t), E( £ (t), etc. 
Also, set h 5 £ (t) = h(z 5 £ {t)). From Equation (13341) . 



H\z,e) 



ldh s E 
e dt 



W 

-k' s (Q - qi(z)) + 4(52(2) 

WK' s (Q- qi (z)) 
-WK> s (q 2 (z)-Q) 



-Q) 



(3.15) 



In particular, H s (z,e) = H 5 (z,0). 

Before proceeding, we need one final technical lemma. 

Lemma 3.4.3. If 5q > is chosen sufficiently small, there exists a constant K 
such that for all S G {0,5 }, n' s (\Q - Xi(z)\) = unless (p { G [1/2 - KS, 1/2 + KS]. 



Argument for uniform convergence 

We start by proving the following lemma, which essentially says that an orbit 
zf(t) only spends a fraction 0{8) of its time in a region of phase space where 
\H s (z 5 £ (t),e) \ = \H s (z 5 £ (t),0)\ is of size C^ 1 ) 



Lemma 3.4.4. For < T < T < 



TAT/ 



/ \H\z s £ (s),0)\ds = O(l V(T-T' 



))• 



Proof. Without loss of generality, T = 0. From Lemmas 13.4. II and 13.4.21 it follows 
that if we choose Sq sufficiently small, then there exists u > such that for all 

sufficiently small e and all 5 G (0, 5 ], h G V =>- 1/oj < ^ff- < u. Define the set 
B = [1/2 — K5, 1/2 + KS], where K comes from Lemma 13.4.31 Then we find a 
crude bound on \k' s (Q £ (s) — qi(z £ (s))) | ds using that 



dt 



is 



> 1/u if <f{ >£ G B 
<u if v { s eB c . 
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This yields 

r-T 



\n' 5 {Qi{s)- qi {4{s)))\ds< 



const 



r 



-'0 



< 



const 



2Ku5 



5 \ 2Ku5 + 1=2& 
0(1 VT). 



T + 2Ku5 



Similarly, f Q d \K' s (q 2 (z 5 £ (s)) - Q s e (s))\ds = 0(1 VT), and so J Q (s), 0) | ds = 

O(IVT). □ 



We now follow steps one through four from Section l3.3.H making modifications 
where necessary. 

Step 1: Reduction using Gronwall's Inequality. Now h £ (r/e) satisfies 

r/e 



Define 



h*(T/e)-h s E (0)=e [ T £ H s (z s £ (s) 1 0)ds. 
Jo 

e s £ (t) = e ^ 0) - H s (h 5 £ (s))ds. 

Jo 



It follows from Gronwall's Inequality and the fact that H'(-) G C 1 ({(5, /i) : < 
5 < So, h e V}) that 



sup \h 5 £ (r/e) - h s (r)\ < ( sup |ef(r)| ) e Lip (^ lv ) T 



C ( sup |ef(r 

. 0<t<TAT« 



(3.16) 



Step 2: A splitting according to particles. Next, 
H\z,0) -H 5 (h) 







-<(Q- gi (,))-^fH 
WV 5 (Q- gi (z)) + W^|^ 




+ 







«)-Q) + 3 f s 
o 



-W<(? 2 (z) - Q) - 



T 2 
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and so, in order to show that sup 0<T<TAT 5 lef (t) I = 0(e), it suffices to show that 



for i = 1, 2, 



sup 

0<r<TAT/ 



sup 

0<t<TAT/ 



t/s 



^(|Q £ 5 ( s )-x,(4(,))|) + 



ds 



o(i), 



r/e 



= 0(1). 
We only demonstrate that 



sup 

0<r<TAT* 



T/6 



*i(ofoo-9i(*J(*))) 



8rai-E7f e (s) 



'/- 



(9(1). 



The other three terms are handled similarly. 



Step 3: A sequence of times adapted for ergodization. Define the se- 
quence of times t s k£ inductively by t & 0£ = inf{t > : (f{^(t) = 0}, tf +l e = inf{t > 



L k,e 



,e\"J "J ' b k+l,e 



(t) = 0}. If e and 5 are sufficiently small and t k+le < (T A T e )/e, then 
it follows from Lemma 13.4.21 and the discussion in the proof of Lemma 13.4.41 that 



1/oj < t 



5 

k+l,e 



tie < W- 



From Lemmas 13.4.21 and 13.4.41 it follows that 



sup 

0<r<TAT« 



<0(1) 



r /e J8miEf E (s) 



C fc+l,e- 



<(Q £ ( S ) - qi (4(s))) + 



8mi^f e (s) 



(3.17) 



Step 4: Control of individual terms by comparison with solutions along 
fibers. As before, it suffices to show that each term in the sum in Equation 
( I3.17P is no larger than 0(e). Without loss of generality we will only examine the 
first term and suppose that t & Qe = 0, i.e. that ip{ e (0) = 0. 

Lemma 3.4.5. Ift( . < then sup 0<t<t s \z&(t) - z 5 Jt)\ = 0(e). 

) £ 1,£ I 
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Proof. By Lemma \MM h s (t) - h 5 £ {t) = /if (0) - h s e (t) = -e f* H s (z s £ (s),0)ds = 
00(1 Vt)) for t > 0. 

Using what we know about the divergence of the slow variables, we find that 



= C(e) 

for < t < t[ e . Lemmas EXH and [3321 ensure the desired uniformity in the sizes 
of the orders of magnitudes. Showing that sup 0<t<t « |y>2,o(A) — ^2,e(*)| = C( £ ) 1S 
similar. 

□ 

From Lemma[3X5]we find that t l e = t lfi + 0(e) = T\(Qg, E%, 5) + 0(s). Hence 



o T^M^ElM^) W Jo T^QiEf^S) 



8 mi E s l£ {s) r t{ J% mi El 



.0 



ds = O(e) + / _v_ r _d s 



0{e) + JS mi Etn. 



But when < Q e — a 



rf /^77 7V^\ sign(^(4( s )))^(gf( s )- gi (4( s ))) 



and so 

I " ^(Qfo) - fc(af00))<fa = -^hEjjO) - ^2m x ElM,e) 

= 0{e) - y/srmElo. 

Hence, 



as desired. 
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3.5 Appendix to Section 13.4 

Proof of Lemma 13.4. It 

Proof. For < 5 < S/2, 



Q—a 



mi/2 



T 1 = T 1 (Q,E l ,8) = 2 I \l Ei _ u l(s) ds > 



T 2 = T 2 (Q,E 2 ,6) = 2 [ \ m2 / 2 ds. 
We only consider the claims about 7\, and for convenience we take m x = 2. Then 

tzs r Q/2 ds 



n~ a ds r 



Et-Uxis) J a y/E X -K S {s) 
Q/2-5 + f s ds 



El J a ^E 1 -K 5 {s) 

2Q-A5 f 1 ds 
+ 45 1 



E\ Jk-\e x ) \JE x - k{s) 



i V E - k{s) Jo y/E-u 



Define 



Notice that (k -1 )'^) diverges as u —* + , while (E — u)~ l l 2 diverges as u — > i? - , 
but both functions are still integrable on [0, E]. It follows that F(E) is well defined. 
Then it suffices to show that F : [£, k(Q) — £] — > R is C 1 . 
Write 

Jo y/E-u Js/2 y/E-u 

:=F 1 {E) + F 2 {E). 

A standard application of the Dominated Convergence Theorem allows us to dif- 
ferentiate inside the integral and conclude that F x G C°°([£, k(0) — £ ]), with 

To examine F 2 , we make the substitution v = E — u to find that 



F2(E) ;^d^m^A dv . 
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Using the fact that (« x )' G C 1 ([S/2, k(0)]) and the Dominated Convergence The- 
orem, we find that F 2 is differentiable, with 

F , (E) = + r- £ ' 2 -A^m^A dv . 



E - £/2 Jo Vv 

Another application of the Dominated Convergence Theorem shows that F 2 is 
continuous, and so F 2 G k(0) — £]). 

Thus 

Ti(Q, E l ,S) = -^=+A5 \-E- 1/2 + F 1 (E 1 ) + F 2 {E, 



l Ex 

has the desired regularity. For future reference, we note that 

^=-7=^ —^ = -^ + 0(5). (3.18) 
dQ sfE x dEx eI 1 

□ 

Corollary 3.5.1. For all 5 sufficiently small, the flow z^it) restricted to the in- 
variant tori Ai c = {h = c} is ergodic (with respect to the invariant Lebesgue 
measure) for almost every ceW. 

Proof. The flow is ergodic whenever the periods T\ and T 2 are irrationally related. 
Fix 5 sufficiently small such that = —Q/E^ 2 + 0{8) < 0. Next, consider 
Q, W, and E 2 fixed, so that T 2 is constant. Because T% G C 1 , it follows that, 
as we let E\ vary, |f ^ Q for almost every E\. The result follows from Fubini's 
Theorem. □ 



Proof of Lemma I3.4.2t 

Proof. For the duration of this proof, we consider the dynamics for a small, fixed 
value of 5 > 0, which we generally suppress in our notation. For convenience, we 
take mi = 2. 

Let tj) denote the map taking (Q, W, qi,vi, q 2 , v 2 ) to (Q, W, E\, E 2 , <px, ip 2 ). We 
claim that ip is a C 1 change of coordinates on the domain of interest. Since E\ = 
v\ + Ks(qi) + K s(Q — qi), Ei is a C 2 function of q%, Vi, and Q. A similar statement 
holds for E 2 . 

The angular coordinates (fi(xi,Vi,Q) are defined by Equation (|3.12p . We only 
consider (fi, as the statements for <p 2 are similar. Then <pi(qi,vi,Q) is clearly C 1 
whenever qi ^ a, Q—a. The apparent difficulties in regularity at the turning points 
are only a result of how the definition of (pi is presented in Equation (13.121) . Recall 
that the angle variables are actually defined by integrating the elapsed time along 
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orbits, and our previous definition expressed ip x in a manner which emphasized 
the dependence on q x . In fact, whenever \v x \ < VE X , 

^■^-{ k + f . £{K -y {El _ v 2 )dv if5l> Q_i. < 3 ' 19 ) 

Here i?i is implicitly considered to be a function of q x ,v x , and Q. One can verify 
that Dip is non-degenerate on the domain of interest, and so ip is indeed a C 1 
change of coordinates. 

Next observe that d<p x ^/dt = 1/T X , so Hadamard's Lemma implies that 

It remains to show that, in fact, we may take f(S) = 1. It is easy to verify this 
whenever q x < Q — S because dE x /dt = there. We only perform the more difficult 
verification when qi > Q — 5. 

When q\ > Q — 5, \v x \ < VEi and E x = v \ + Kg(Q — qi). From Equation (13.191) 
we find that 



1 26 



To find difi/dt, we consider ip x as a function of v x , Q, and Ex, so that 

dip i dipi dvi dipi dQ d<p x dEi 
dt dv x dt dQ dt dE x dt 

Then, using Equations (13.181) and (13.201) . we compute 

d<pidvi 2 x , 2 ^'s(Q ~ gi) 1 

d^^F = } (El - V ^ 2 = T x ' 

fl^idg 1/2 - ^ flTi = ^1/2 - Vl 2 



<9Q dt t x dQ y ' t x y/Ei 

dip, dE x ( 1/2- ip x dT x 25 f v \ _ x 2 \ 

^1T = (— + {K )(E 1 -v)dvj(sW Ks (Q-q x )). 

Using that ^(Q - q x ) = k'{k- 1 {E x - vf))/5 = (5(k~ 1 )'(E x - vf))' 1 , we find that 



<9£i dt V <5 / \(«~ 1 )'( s i-' u i) Jo 

But here 1/2 — ip x is 0(5). See the proof of Lemma [3.4.31 below. Thus the claims 
about dip x /dt will be proven, provided we can uniformly bound 

^(K^y^-v^dv. 



{k- 1 )'{E x -vZ 
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r E[ is entirely due to 



Note that the apparent divergence of the integral as \v\\ 
the fact that our expression for ipi from Equation (13.201) requires < s/E\. If 
we make the substitution u = E\ — v 2 and let e = E\ — vf , then it suffices to show 
that 

1 



sup sup 

£<E 1 <n(0)-£ 0<e<E 1 



y/Ei-U 



rd,U 



< + OO. 



The only difficulties occur when e is close to 0. Thus it suffices to show that 

1 



sup sup 

£<£i<k(0)-£ 0<e<£/2 

is finite. But this is bounded by 



£/2 (k- 1 )»(u) 



Ve^ 



rdu 



u 



sup 

0<e<£/2 



sup 

0<e<£/2 



which is finite because (k 
be proven similarly. 



-ly/ 



-00 as e — > + . The claims about dif2/dt can 



□ 



Proof of Lemma I3.4.3t 



Proof. We continue in the notation of the proofs of Lemmas 13.4. II and 13.4.21 above, 
and we set ni\ = 2. Then from Equation (13.201) . we see that Kg(Q — qi) = unless 



ki-l/2| < 
is similar. 



25 
Ti 



^{^'{Ex-v^dv = 5F{E l )/T 1 = 0(8). Dealing with <p 2 



□ 
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Chapter 4 



The periodic oscillation of an 
adiabatic piston in two or three 
dimensions 

In this chapter, we present our results for the piston system in two or three di- 
mensions. These results may also be found in [Wri07j. 

4.1 Statement of the main result 
4.1.1 Description of the model 

Consider a massive, insulating piston of mass M that separates a gas container 
T> in R d , d = 2 or 3. See Figure 14.11 Denote the location of the piston by Q, 
its velocity by dQ/dt = V, and its cross-sectional length (when d — 2, or area, 
when d — 3) by £. If Q is fixed, then the piston divides T> into two subdomains, 
T^i{Q) — T^i on the left and T> 2 (Q) = T> 2 on the right. By E^ we denote the 
total energy of the gas inside and by |T>j| we denote the area (when d — 2, or 
volume, when d = 3) of 

We are interested in the dynamics of the piston when the system's total energy 
is bounded and M — >• oo. When M = oo, the piston remains fixed in place, and 
each energy E{ remains constant. When M is large but finite, MV 2 /2 is bounded, 
and so V = 0{M~ x l 2 ). It is natural to define 

e = M~ 1/2 , 




so that W is of order 1 as e — > 0. This is equivalent to scaling time by e. 
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M = e~ 2 > 1 




t I r 



Q 1 



Figure 4.1: A gas container Pel 2 separated by a piston. 

Next we precisely describe the gas container. It is a compact, connected billiard 
domain Pel' 1 with a piecewise C 3 boundary, i.e. &D consists of a finite number of 
C 3 embedded hyper surf aces, possibly with boundary and a finite number of corner 
points. The container consists of a "tube," whose perpendicular cross-section V is 
the shape of the piston, connecting two disjoint regions. V C is a compact, 
connected domain whose boundary is piecewise C 3 . Then the "tube" is the region 
[0, 1] x V C V swept out by the piston for < Q < 1, and [0, 1] x dV C dV. If 
d — 2, V is just a closed line segment, and the "tube" is a rectangle. If d — 3, V 
could be a circle, a square, a pentagon, etc. 

Our fundamental assumption is as follows: 

Main Assumption. For almost every Q G [0, 1] the billiard flow of a single 
particle on an energy surface in either of the two subdomains T>i(Q) is ergodic 
(with respect to the invariant Liouville measure). 

If d — 2, the domain could be the Bunimovich stadium |Bun79] . Another possible 
domain is indicated in Figure 14.11 The ergodicity of billiards in such domains, 
which produce hyperbolic flows, goes back to the pioneering work of Sinai [Sin70j . 
although a number of individuals have contributed to the theory. A full accounting 
of this history can be found in |CM0 6aj. Polygonal domains satisfying our assump- 
tions can also be constructed |Vor97] . Suitable domains in d = 3 dimensions can be 
constructed using a rectangular box with shallow spherical caps adjoined [BR98J. 
Note that we make no assumptions regarding the hyperbolicity of the billiard flow 
in the domain. 

The Hamiltonian system we consider consists of the massive piston of mass M 
located at position Q, as well as ni+n 2 gas particles, n\ in T>\ and n 2 in T> 2 . Here n\ 
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and n 2 are fixed positive integers. For convenience, the gas particles all have unit 
mass, though all that is important is that each gas particle has a fixed mass. We 
denote the positions of the gas particles in T>i by q^j, 1 < j < n^. The gas particles 
are ideal point particles that interact with dT> and the piston by hard core, elastic 
collisions. Although it has no effect on the dynamics we consider, for convenience 
we complete our description of the Hamiltonian dynamics by specifying that the 
piston makes elastic collisions with walls located at Q — 0, 1 that are only visible 



to the piston. We denote velocities by dQ/dt 
we set 



E i,j 



1,3 



A 



Ei 



V = eW and dq i: j/dt = Vi and 



Our system has d{rii + n 2 ) + 1 degrees of freedom, and so its phase space is 
(2d(n 1 + n 2 ) + 2)-dimensional. 
We let 



h(z) =h = (Q, W, E 1;1 , E 1;2 , ■■■ , E 1>ni ,E 2;1 , E 2>2 , 



E 



2,n, 2 , 



so that h is a function from our phase space to M. ni+n2+2 . We often abbreviate 
h = (Q, W, E 1 j, E 2 j), and we refer to h as consisting of the slow variables because 
these quantities are conserved when e = 0. We let h £ (t,z) = h e (t) denote the 
actual motions of these variables in time for a fixed value of e. Here z represents 
the initial condition in phase space, which we usually suppress in our notation. 
One should think of h e (-) as being a random variable that takes initial conditions 
in phase space to paths (depending on the parameter t) in M ni+n2+2 . 



4.1.2 The averaged equation 

From the work of Neishtadt and Sinai |NS04j . one can derive 



d 

dr 



Q ' 

w 



H{h) 



W 



2E x i 



2Eo 



d\V 2 (Q)\ 
2WE Xj " 



d 



2W 



2J< 



d\V 2 (Q)\ 



(4.1) 



as the averaged equation (with respect to the slow time r = et) for the slow 
variables. Later, in Section 14.2.31 we will give another heuristic derivation of the 
averaged equation that is more suggestive of our proof. 

Neishtadt and Sinai |Sin99t [NB04J pointed out that the solutions of Equa- 
tion fll.3p have (Q, W) behaving as if they were the coordinates of a Hamiltonian 
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system describing a particle undergoing motion inside a potential well. As in 
Section 11.21 the effective Hamiltonian is given by 

\_ w% EM\T>x{Qmf' d , ^(0)|P 2 (g(0))| 2/<f 
2 |Pi(Q)| 2/d \T> 2 (Q)\ 2/d 

This can be seen as follows. Since 

d\Vi(Q)\ gW^j 

d\n(Eij)/dT = -{2/d)dln{\Vi(Q)\)/dr, and so 

Pi(Q(0))|V /d 



By summing over j, we find that 

Ei(r) = Ei(0) 

and so 



MQ{t))\J ' 
^ (Q(Q))l V /d 



A(Q(r))i; 

rf 2 Q(r) = 2£E 1 (0)\V 1 (Q(0))\ 2/d _ 2£ E 2 (0) \V 2 (Q(0))\ 2/d 
dr 2 d |P 1 (Q(r))| 1+2/d d \V 2 (Q(r))\ 1+2/d 
Let h(r, z) = h(r) be the solution of 

dh - - 

— = H(h), h(0) = he(0). 
Again, think of h(-) as being a random variable. 



4.1.3 The main result 

The solutions of the averaged equation approximate the motions of the slow vari- 
ables, h £ (t), on a time scale 0(1/ e) as e — ► 0. Precisely, fix a compact set 
V C M"i+™2+ 2 such that ft G V Q CC (0,1), W CC R, and E itj CC (0, oo) 
for each % and We will be mostly concerned with the dynamics when h G V. 
Define 

Qmin hlf Q) Qmax SUpQ, 

heV heV 

E min = inf \w 2 + E l + E 2 , E max = sup \w 2 + E x + £ 2 - 
/iev z ^gv z 

1 We have introduced this notation for convenience. For example, h E V Q CC (0, 1) 
means that there exists a compact set A C (0, 1) such that h e V ^> Q e A, and similarly for 
the other variables. 
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For a fixed value of e > 0, we only consider the dynamics on the invariant subset 
of phase space defined by 

M £ = {(Q,V,q t „v tJ ) E MMm+n 2 )+2 . Q E [0)1]) % . G p . (g)j 

M 2 

^tnm < -^-V" + Ei + E 2 < E max \. 

Let P £ denote the probability measure obtained by restricting the invariant Liou- 
ville measure to M. e . Define the stopping time 

T £ (z) =T £ = inf{r > : h(r) £ V or h £ (r/e) £ V}. 

Theorem 4.1.1. IfT> is a gas container in d = 2 or 3 dimensions satisfying the 
assumptions in Subsection \4-l.l\ above, then for each T > 0, 

sup \h e {r/e) — h(r)\ — > in probability as e = M~ 1//2 — > 0, 

0<t<TAT e 

i.e. for each fixed S > 0, 

PA sup \he(r/e) - h(r)\ > 5 J -»• as e = M" 1/2 -> 0. 

\0<r<rAT £ / 

Remark 4.1.1. It should be noted that the stopping time in the above result is not 
unduly restrictive. If the initial pressures of the two gasses are not too mismatched, 
then the solution to the averaged equation is a periodic orbit, with the effective 
potential well keeping the piston away from the walls. Thus, if the actual motions 
follow the averaged solution closely for < r < T A T £ , and the averaged solution 
stays in V, it follows that T e > T. 

Remark 4.1.2. The techniques of this work should immediately generalize to prove 
the analogue of Theorem HXT] above in the nonphysical dimensions d > 3, although 
we do not pursue this here. 

Remark 4.1.3. As in Subsection 13.1.31 Theorem 14.1.11 can be easily generalized 
to cover a system of iV — 1 pistons that divide N gas containers, so long as, for 
almost every fixed location of the pistons, the billiard flow of a single gas particle 
on an energy surface in any of the N subcontainers is ergodic (with respect to the 
invariant Liouville measure). The effective Hamiltonian for the pistons has them 
moving like an (N — l)-dimensional particle inside a potential well. 

4.2 Preparatory material concerning a 

two-dimensional gas container with only one 
gas particle on each side 

Our results and techniques of proof are essentially independent of the dimension 
and the fixed number of gas particles on either side of the piston. Thus, we focus 
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Figure 4.2: A choice of coordinates on phase space. 

on the case when d = 2 and there is only one gas particle on either side. Later, 
in Section 14.41 we will indicate the simple modifications that generalize our proof 
to the general situation. For clarity, in this section and next, we denote q% } i by q\, 
v 2 ,i by v 2 , etc. We decompose the gas particle coordinates according to whether 
they are perpendicular to or parallel to the piston's face, for example qi = (q^, q\). 
See Figure 14.21 

The Hamiltonian dynamics define a flow on our phase space. We denote this 
flow by z s (t,z) = z e (t), where z = z £ (0,z). One should think of z e (-) as being 
a random variable that takes initial conditions in phase space to paths in phase 
space. Then h £ {t) = h(z E (t)). By the change of coordinates W = V/e, we may 
identify all of the M. e defined in Section H~T1 with the space 

M = {{Q,W,q 1 ,v 1 ,q 2 ,v 2 ) ER W :QE [0,1], q x E £>i(Q), q 2 E V 2 (Q), 

Emin < -W 2 + Ei + E 2 < E max }. 

and all of the P e with the probability measure P on A^, which has the density 

dP = const dQdWdq^dq\dvidv\dq 2 dq 2 dv2dv\. 

(Throughout this work we will use const to represent generic constants that are 
independent of e.) We will assume that these identifications have been made, so 
that we may consider z e (-) as a family of measure preserving flows on the same 
space that all preserve the same probability measure. We denote the components 
of z e {t) by Q e (t), q^ £ (t), etc. 

The set {z E M. : q\ = Q = q 2 } has co-dimension two, and so [J t z £ (t){q 1 = 
Q = q 2 } has co-dimension one, which shows that only a measure zero set of initial 
conditions will give rise to three particle collisions. We ignore this and other 
measures zero events, such as gas particles hitting singularities of the billiard flow, 
in what follows. 
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Now we present some background material, as well as some lemmas that will 
assist us in our proof of Theorem 14.1.11 We begin by studying the billiard flow 
of a gas particle when the piston is infinitely massive. Next we examine collisions 
between the gas particles and the piston when the piston has a large, but finite, 
mass. Then we present a heuristic derivation of the averaged equation that is 
suggestive of our proof. Finally we prove a lemma that allows us to disregard 
the possibility that a gas particle will move nearly parallel to the piston's face 
- a situation that is clearly bad for having the motions of the piston follow the 
solutions of the averaged equation. 

4.2.1 Billiard flows and maps in two dimensions 

In this section, we study the billiard flows of the gas particles when M = oo and 
the slow variables are held fixed at a specific value h G V. We will only study 
the motions of the left gas particle, as similar definitions and results hold for the 
motions of the right gas particle. Thus we wish to study the billiard flow of a point 
particle moving inside the domain T>\ at a constant speed \J2E\. The results of 
this section that are stated without proof can be found in [CM06a] . 

Let TT>i denote the tangent bundle to D\. The billiard flow takes place in 
the three-dimensional space A4\ = M 1 = {(<7i,i>i) G TT>\ : q\ G T>i, \v%\ = 
\/2Ei}/ ~. Here the quotient means that when q± G &D\, we identify velocity 
vectors pointing outside of T>\ with those pointing inside T>\ by reflecting through 
the tangent line to &D\ at q\, so that the angle of incidence with the unit normal 
vector to &D\ equals the angle of reflection. Note that most of the quantities 
defined in this subsection depend on the fixed value of h. We will usually suppress 
this dependence, although, when necessary, we will indicate it by a subscript h. 
We denote the resulting flow by y(t,y) = y(t), where y(0,y) = y. As the billiard 
flow comes from a Hamiltonian system, it preserves Liouville measure restricted 
to the energy surface. We denote the resulting probability measure by fi. This 
measure has the density d/j, = dq\dv i/(2iiy / 2Ei \T>x\). Here dq\ represents area on 
M 2 , and dv\ represents length on S^ET = [yi G M 2 : |t>i| = \ / 2E 1 }. 

There is a standard cross-section to the billiard flow, the collision cross-section 
Q = {(qi,Vi) G TV\ : q\ G dT>i, \v\\ = y/2Ei}/ ~. It is customary to parameter- 
ize Q by {x = (r,ip) : r G &Di, (p G [— 7r/2, +7r/2]}, where r is arc length and ip 
represents the angle between the outgoing velocity vector and the inward pointing 
normal vector to <9T>i. It follows that Q may be realized as the disjoint union 
of a finite number of rectangles and cylinders. The cylinders correspond to fixed 
scatterers with smooth boundary placed inside the gas container. If F : Q O is the 
collision map, i.e. the return map to the collision cross- sect ion, then F preserves the 
projected probability measure u, which has the density dv = cos ip dip dr / (2 |9£>i|). 
Here | &D\ \ is the length of &V\ . 
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We suppose that the flow is ergodic, and so F is an invertible, ergodic mea- 
sure preserving transformation. Because &D\ is piecewise C 3 , F is piecewise C 2 , 
although it does have discontinuities and unbounded derivatives near discontinu- 
ities corresponding to grazing collisions. Because of our assumptions on T>i, the 
free flight times and the curvature of &D\ are uniformly bounded. It follows that 
if x ^ dCl U F~ 1 (dd), then F is different iable at x, and 

\\DF(x)\\< CQI ; S * (4.2) 

where <p(Fx) is the value of the <p coordinate at the image of x. 

Following the ideas in Section 14.51 we induce F on the subspace Cl of Cl cor- 
responding to collisions with the (immobile) piston. We denote the induced map 
by F and the induced measure by v. We parameterize Cl by {(r, (p) : < r < 
£, E [— 7r/2, +7r/2]}. As vCl = £/\dT>i\, it follows that v has the density 
dv = cos ipdipdr/ (2£) . 

For i6 0, define (x to be the free flight time, i.e. the time it takes the billiard 
particle traveling at speed yj2E\ to travel from x to Fx. If x ^ <9f2 U F _1 (9fi), 

Santalo's formula |San76t IChe97j tells us that 

EX = r^TT- (4-4) 



If C, : Cl — > R is the free flight time between collisions with the piston, then it 
follows from Proposition 14.5.11 that 

The expected value of lun when the left gas particle collides with the (immo- 
bile) piston is given by 



/2fi r+ir/2 

Efi\v^\=E P y/2E^coa(p=- -/ cos 2 <p dtp = y/2E^-. (4.6) 



We wish to compute lim^oot -1 J* Q |2^(s)| 5 q ±^ = gds, the time average of the 
change in momentum of the left gas particle when it collides with the piston. If this 
limit exists and is equal for almost every initial condition of the left gas particle, 
then it makes sense to define the pressure inside V 1 to be this quantity divided 
by £. Because the collisions are hard-core, we cannot directly apply Birkhoff's 
Ergodic Theorem to compute this limit. However, we can compute this limit by 
using the map F . 
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Lemma 4.2.1. If the billiard flow y(t) is ergodic, then for [l — a.e. y G M 1 , 



lim - 

t^oo t 



K ( s )\ ^ q ^{s)=Qds 



2\V 1 {Q)y 



Proof. Because the billiard flow may be viewed as a suspension flow over the 
collision cross-section with ( as the height function, it suffices to show that the 
convergence takes place for v — a.e. x G Cl. For an initial condition x G Cl, define 
Nf(x) = Nt — 4r~ \ s ^ (0, t] : y(s,x) G Cl >. By the Poincare Recurrence Theorem, 



N t 



oo as t — > oo, v — a.e. 



But 



ZtoC(F"x 



i N * 1 r t 

■^y^KI (P n x) < - / \vi 



< 



N. 



Nt 



and so the result follows from Birkhoff's Ergodic Theorem and Equations 04 .5 1) 
and gU). □ 

Corollary 4.2.2. If the billiard flow y(t) is ergodic, then for each 5 > 0, 



{1 /"* E i 

V^M 1 : - J \vi(s)\5 q ± (s)=Q ds - —p - 1 

4.2.2 Analysis of collisions 



(Q)\ 



> s 



as t — > oo. 



In this section, we return to studying our piston system when e > 0. We will 
examine what happens when a particle collides with the piston. For convenience, 
we will only examine in detail collisions between the piston and the left gas particle. 
Collisions with the right gas particle can be handled similarly. 

When the left gas particle collides with the piston, v i and V instantaneously 
change according to the laws of elastic collisions: 





1 


"1 - M 2M ' 




K1 


v + 


1 + M 


2 M - 1 




v- 



In our coordinates, this becomes 





1 


~e 2 - 1 2e 


w + 


1 +e 2 


2e l-e 2 





K1 







(4.7) 
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Recalling that Vi, W = 0(1), we find that to first order in e, 



v 



+ 



vf- + 0(s) 



W + 



W~ + 0{e). 



(4.8) 



Observe that a collision can only take place if v ± > eW~ . In particular, v 1 _ > 
—ey/2E max . Thus, either > or = 0(e). By expanding Equation (14.71) 
to second order in e, it follows that 



Note that it is immaterial whether we use the pre-collision or post-collision values 
of W and pn on the right hand side of Equation (14.91) . because any ambiguity 
can be absorbed into the 0(e 2 ) term. 

It is convenient for us to define a "clean collision" between the piston and the 
left gas particle: 

Definition 4.2.1. The left gas particle experiences a clean collision with the 
piston if and only if > and < —£\/2E max . 

In particular, after a clean collision, the left gas particle will escape from the 
piston, i.e. the left gas particle will have to move into the region < before 
it can experience another collision with the piston. It follows that there exists 
a constant C\ > 0, which depends on the set V, such that for all e sufficiently 
small, so long as Q > Q m in and \v^ \ > sC\ when e [Qmin,Q], then the 
left gas particle will experience only clean collisions with the piston, and the time 
between these collisions will be greater than 2Q min J (^2E max ). (Note that when we 
write expressions such as q± G [Q m i n ,Q], we implicitly mean that qi is positioned 
inside the "tube" discussed at the beginning of Section |4~T1 ) One can verify that 
C\ — 5y / 2E max would work. 

Similarly, we can define clean collisions between the right gas particle and 
the piston. We assume that C\ was chosen sufficiently large such that for all e 
sufficiently small, so long as Q < Q ma x and \v%\ > eC\ when q^ G [Q, Qmax], then 
the right gas particle will experience only clean collisions with the piston. 

Now we define three more stopping times, which are functions of the initial 
conditions in phase space. 



T' £ =inf{r > : Q min < qf^r/e) < Q e {t/e) < Q max and |^ £ (r/e)| < C x e}, 
T'J =inf{r > : Q min < Q £ {r/e) < q^r/e) < Q max and \v^{t/s)\ < de}, 
f £ =T A T e A T' £ A T" 




(4.9) 
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Define H(z) by 



H(z) 



W 

+2\vt\5 qt=Q -2\vi\5 qi 
-2W\vi\5 qi=Q 
+2W\v 2 L \5 q , =Q 



Here we make use of Dirac delta functions. All integrals involving these delta 
functions may be replaced by sums. 

The following lemma is an immediate consequence of Equation (14. 9p and the 
above discussion: 

Lemma 4.2.3. If < t± < t 2 < T £ /e, the piston experiences 0((t 2 — ii) V 1) 
collisions with gas particles in the time interval [ti,t 2 ], all of which are clean 
collisions. Furthermore, 



h e (t 2 )-h e (t 1 ) = 0(e)+e I' H{z E { 

Jh 



s))ds. 



Here any ambiguities arising from collisions occurring at the limits of integration 
can be absorbed into the 0(e) term. 



4.2.3 Another heuristic derivation of the averaged equa- 
tion 



The following heuristic derivation of Equation (14. ip when d = 2 was suggested 
in [Dol05] . Let At be a length of time long enough such that the piston experiences 
many collisions with the gas particles, but short enough such that the slow variables 
change very little, in this time interval. From each collision with the left gas 
particle, Equation (14.91) states that W changes by an amount +2e \v^\ + 0(e 2 ), 
and from Equation H4.6[) the average change in W at these collisions should be 
approximately en^2Ei/2 + 0{e 2 ). From Equation (14. 5 p the frequency of these 
collisions is approximately y/2E\ £/(tt \T>i\). Arguing similarly for collisions with 
the other particle, we guess that 



AW 



— £; 



At |2MQ)| \V 2 (Q)\ 



With r = et as the slow time, a reasonable guess for the averaged equation for W 

dW Ei£ _ E 2 i 
~dT~ |XMQ)| ~ \v 2 {Q)\' 
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Similar arguments for the other slow variables lead to the averaged equation (14. ip . 
and this explains why we used Pj = E^j for the pressure of a 2-dimensional 
gas in Section 11.21 

There is a similar heuristic derivation of the averaged equation in d > 2 dimen- 
sions. Compare the analogues of Equations H4.5[) and (14.61) in Subsection 14.4.21 



4.2.4 A priori estimate on the size of a set of bad initial 
conditions 

In this section, we give an a priori estimate on the size of a set of initial conditions 
that should not give rise to orbits for which sup 0<T<TATe \h e (r/e) — h(r)\ is small. 
In particular, when proving Theorem 14.1.11 it is convenient to focus on orbits that 
only contain clean collisions with the piston. Thus, we show that P{T £ < T A T £ } 
vanishes as e — > 0. At first, this result may seem surprising, since P{T' £ A T" = 
0} = 0(e), and one would expect uJ=qZ £ (— t){T' £ A T" = 0} to have a size of order 
1. However, the rate at which orbits escape from {T' £ A T £ = 0} is very small, and 
so we can prove the following: 

Lemma 4.2.4. 

P{f £ <TAT £ } = 0(e). 

In some sense, this lemma states that the probability of having a gas particle 
move nearly parallel to the piston's face within the time interval [0, T/e], when one 
would expect the other gas particle to force the piston to move on a macroscopic 
scale, vanishes as e — > 0. Thus, one can hope to control the occurrence of the 
"nondiffusive fluctuations" of the piston described in |CD06a] on a time scale 
0(e- 1 )- 

Proof. As the left and the right gas particles can be handled similarly, it suffices 
to show that P{T' £ < T} = O(e). Define 

<B £ = {z G M : Qmin <qi <Q < Qmax and |^| < de}. 
' /£ z e (-t)*B e , and if 7 = Q min j \/8E max , 

(T/e \ / T/e 

\t=0 J \ t=0 

'T/(ey) 




<P*B e + P |J z^) [U(^(0« e )\Q5 £ 
y fe=o t=o 

< PK £ + (J- + lj P ( U(z £ (t)Q3 £ )\Q5 £ 
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Now P93 e = 0(e), so if we can show that P (\J} =0 (z e (t)VB e )\<B e ) = ( £2 )> then [t 
will follow that P{T £ ' < T} = 0(e). 

If 2 £ jJJl (z e (t)58 e )\Q3 e , ^ i s s ^iH true that Iv^l = 0(e). This is because 
changes by at most O(e) at the collisions, and if a collision forces \v±\ > C\E 1 then 
the gas particle must escape to the region < before v i can change again, and 
this will take time greater than 7. Furthermore, if z £ UJ =0 (2: e (i)Q5 e )\23 e , th en a t 
least one of the following four possibilities must hold: 

• |<7l" - Qmin\ < 0(e), 

• \Q-Qmin\ < 0(e), 

• \Q-QmaA < 0(S), 

• \Q-qi\<0{e). 

It follows that P ({Jl =0 (z £ (t)<B £ )\<B £ ) = 0(e 2 ). For example, 



M 



1 , , , r]P 

{K <0(e), \li-Qmin \<0{e)} ar 



const / lri ±| <0 f e) \dWdv 1 dv\dv^ dv\ 

{E mln <W^/2+vl/2+vl/2<E max } 1 1 

x / 1 { . t _ Qmin ^ 0{£)} dQdqidqldq^dql 



{Q6[o,i],9iex>i, g2 ex> 2 } 



□ 



4.3 Proof of the main result for two-dimensional 
gas containers with only one gas particle on 
each side 

As in Section 14.21 we continue with the case when d = 2 and there is only one gas 
particle on either side of the piston. 

4.3.1 Main steps in the proof of convergence in probability 

By Lemma |4.2.4[ it suffices to show that sup 0<r< j, £ \h £ (r/e) — h(r)\ — > in prob- 
ability as e = M~ l l 2 — > 0. Several of the ideas in the steps below were inspired by 
a recent proof of Anosov's averaging theorem for smooth systems that is due to 
Dolgopyat |Dol05| . 
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Step 1: Reduction using Gronwall's Inequality. Observe that h{r) satisfies 
the integral equation 

h( T ) - h(0) = [ H(h(a))da, 
Jo 

while from Lemma 14.2.31 

rr/e 



h £ (r/s)-h £ (0) = O(e)+e [ * H(z £ (s))ds 

Jo 



0(e)+e [ ' H(z £ (s))-H(h £ (s))ds + [ H(h £ (a/e))da 
Jo Jo 



for < r < T £ . Define 



e £ (r) =e [ ^ H(z £ (s)) - H(h £ (s))ds. 
Jo 

It follows from Gronwall's Inequality that 

sup \h £ (r/e) -h(r)\ < ( 0(e) + sup |e £ (r)| ) e Lip ( 5|v ) T . (4.10) 

Gronwall's Inequality is usually stated for continuous paths, but the standard 
proof (found in [SV85J) still works for paths that are merely integrable, and 
\h £ (r/e) — h(r)\ is piecewise smooth. 

Step 2: Introduction of a time scale for ergodization. Let L(e) be a 
real valued function such that L(e) — > oo, but L(e) \oge~ 1 , as e — > 0. In 
Section T4.3.2I we will place precise restrictions on the growth rate of L(e). Think 
of L(e) as being a time scale that grows as e —> so that ergodization, i.e. the 
convergence along an orbit of a function's time average to a space average, can 
take place. However, L(e) doesn't grow too fast, so that on this time scale z £ (t) 
essentially stays on the submanifold {h = h £ (0)}, where we have our ergodicity 
assumption. Set tk >£ = kL(e), so that 



sup |e e (r)| < 0{eL(e)) + e 

0<T<f e k=Q 



fc+1 ' £ H{z E {s)) - H(h £ (s))ds 



(4.11) 



Step 3: A splitting according to particles. Now H(z) — H(h(z)) divides 
into two pieces, each of which depends on only one gas particle when the piston 
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is held fixed: 



H(z)-H(h(z)) 







2 \ V l\ &qi=Q lUifQll 

-2W\vt\5 q , =Q + ^ 




+ 



E 2 e 
\V 2 (Q)\ 



-2\v±\6. 



2 | <V=Q 







P2(Q)| 



2 I ^ 



We will only deal with the piece depending on the left gas particle, as the right 
particle can be handled similarly. Define 



G(z) = \vi\S q ± 



G{h) 



2|Pi(Q)|' 



(4.12) 



Returning to Equation (14. lip , we see that in order to prove Theorem 14.1.11 it 
suffices to show that both 



T E 

E 

k=0 

T £ 
zL(e) 

E 

k=0 



G(z E (s))-G{h E (s))ds 



and 



W £ (s)(G(z £ (s))-G(h £ (s)))ds 



tk.t 



converge to in probability as e — > 0. 



Step 4: A splitting for using the triangle inequality. Now we let z k>£ (s) 

be the orbit of the e = Hamiltonian vector field satisfying Zk, s (tk,e) = z e(tk,e)- 
Set hk, £ (f) = h(zk,e(t))- Observe that hk, £ (t) is independent of t. 

We emphasize that so long as < t < T £ /e, the times between collisions of a 
specific gas particle and piston are uniformly bounded greater than 0, as explained 
before Lemma 14.2.31 It follows that, so long as %+i, e < T £ /e, 

sup \h Ke {t)-h £ {t)\=0{eL{e)). (4.13) 

This is because the slow variables change by at most 0(e) at collisions, and 
dQ £ /dt = 0(e). 
Also, 

[ tk+1 ' £ W £ (s){G(z £ (s))-G(h £ (s)))ds 

J tk,e 

= 0(eL(s) 2 ) + W kj£ (t kjE ) G(z £ (s)) - G(h £ (s))ds, 
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and so 



E 

fc=0 



'fc+1,6 



W £ (s)(G(z £ (s))-G(h £ (s)))ds 



< 0(eL(e)) + e const ^ 

fc=0 



*fe+l,e 



G(z e (s)) - G{h £ {s))ds 



Thus, in order to prove Theorem 14.1.14 h suffices to show that 



sL(s) 



E 

fc=0 



tk + l, 



G(z E {s)) - G(h £ (s))ds 



T E 
eL(e) " 



fc=0 



converges to in probability as e — > 0, where 



■k+l,s 



G{z E (s)) - G(z k , £ (s))ds, 



■k+l,s 



G(z k>£ (s)) - G(h kt£ (s))ds, 



■k+l,s 



G{h k>£ {s)) - G{h £ {s))ds. 



The term II k)£ represents an "ergodicity term" that can be controlled by our 
assumptions on the ergodicity of the flow Zo(t), while the terms I kt£ and III k , £ 
represent "continuity terms" that can be controlled by controlling the drift of 
z £ (t) from z k;£ (t) for t k)£ <t< t k+li£ . 

Step 5: Control of drift from the e = orbits. Now G is uniformly 
Lipschitz on the compact set V, and so it follows from Equation (14.131) that 



III K£ = 0{eL{e) 2 ). Thus, e 



k=0 



\III k)£ \ = 0(eL(e)) -> as e -> 0. 



Next, we show that for fixed 5 > 0, P ye Ylk=o \^,e\ > $ J — > as e — > 0. 
For initial conditions zeM and for integers k G [0,T/(eL(e)) — 1] define 



fr~l 



A. 



k,e 



^ : , ; ; lifcel > ^= and < 



L(e 

A z>£ = {k: z E A k>£ } 



2T 



eL(e) 



Think of these sets as describing "poor continuity" between solutions of the e = 
and the e > Hamiltonian vector fields. For example, roughly speaking, z G A k , £ 
if the orbit z £ {i) starting at z does not closely follow z k)£ (t) for t k)£ <t< t k+ \ t£ . 
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One can easily check that \Ik, s \ < 0(L{e)) for k < T e /(eL(e)) — 1, and so it 
follows that 



_Zk i 

~ L ' : § 



e \h,e\<7; + 0(eL(e)#(A z , £ )). 



k=0 



Therefore it suffices to show that P (#(A Zt£ ) > 5(const eL(e)) x ) — > as e —* 0. 
By Chebyshev's Inequality, we need only show that 



Ep(eL{s)#{A z , e ))=eL{e) ^ P(A k , £ ) 



k=0 

tends to with e. 

Observe that z E (tk >£ )Ak, £ C Aq >£ . In words, the initial conditions giving rise to 
orbits that are "bad" on the time interval [tk, £ , £fc+i,e]> moved forward by time tk >e , 
are initial conditions giving rise to orbits which are "bad" on the time interval 
[to,e,ti,e\- Because the flow z s {-) preserves the measure, we find that 

T 1 



eL{e) p (Ak,e) < const P(Ao, £ ). 



k=0 

To estimate P(Ao, £ ), it is convenient to use a different probability measure, 
which is uniformly equivalent to P on the set {z G M. : h(z) G V} D {T £ > eL(e)}. 
We denote this new probability measure by P* , where the / stands for "factor." 
If we choose coordinates on M. by using h and the billiard coordinates on the two 
gas particles, then P? is defined on M. by dP^ = dhdfj^d/j^, where dh represents 
the uniform measure on V C M 4 , and the factor measure d\i l h represents the in- 
variant billiard measure of the i th gas particle coordinates for a fixed value of the 
slow variables. One can verify that l{h(z)ev}dP < const dPf, but that Pf is not 
invariant under the flow z e (-) when e > 0. 

We abuse notation, and consider fi\ to be a measure on the left particle's initial 
billiard coordinates once h and the initial coordinates of the right gas particle are 
fixed. In this context, fi\ is simply the measure /i from Subsection 14.2.11 Then 



P f (Ao,e) 

< / dhdn\-[i\ 



L(e) 



L(e) 

G{z £ {s))-G{z {s))ds 



> A and sL{e) < f £ 



and we must show that the last term tends to with e. By the Bounded Con- 
vergence Theorem, it suffices to show that for almost every h G V and initial 
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condition for the right gas particle, 



1 



G{z £ {s)) - G(z (s))ds 



> ^ and eL(e) < f £ 



as e — > 0. 



m J0 

(4-14) 

Note that if G were a smooth function and z £ (-) were the flow of a smooth 
family of vector fields Z(z,e) that depended smoothly on e, then from Gronwall's 
Inequality, it would follow that sup < t < L(s) \z £ {t) - z (t)\ < 0{eL(e)e u ^ z)L ^ £) ). If 

this were the case, then L{e)~ l j^ e) G{z £ {s)) - G{z {s))ds = 0{eL{e)e Up ^ L ^), 
which would tend to with e. Thus, we need a Gronwall-type inequality for billiard 
flows. We obtain the appropriate estimates in Section 14.3.21 



Step 6: Use of ergodicity along fibers to control Ih, e - 



to be shown is that for fixed 5 > 0, P ye Ylk=o \Hk,e\ > 8 

For initial conditions z G M. and for integers k G [0, T/(eL(e)) — 1] define 



All that remains 
• as e — > 0. 



B 7F 



1 



L{e r 
{k-.ze B Ke } . 



I Ik J > 



_5_ 

2T 



and k < 



eL{e) 



Think of these sets as describing "bad ergodization." For example, roughly speak- 
ing, z G Bk, £ if the orbit z £ {t) starting at z spends the time between tk, e and tk+i, E 
in a region of phase space where the function G(-) is "poorly ergodized" on the 
time scale L(e) by the flow z (t) (as measured by the parameter 5/2T). Note that 



<5 g ± = Q is not really a function, but that we may still speak of the 



G(z) = I 

convergence of t -1 J* G(z (s))ds as t — > oo. As we showed in Lemma 14.2.11 the 
limit is G(h ) for almost every initial condition. 

Proceeding as in Step 5 above, we find that it suffices to show that for almost 
every h G V, 

,-t 



G(z {s))ds - G(h (0)) 



> —— )■ — ► as t — > oo. 
~ 2T 



But this is simply a question of examining billiard flows, and it follows immediately 
from Corollary 14.2.21 and our Main Assumption. 



4.3.2 A Gronwall-type inequality for billiards 

We begin by presenting a general version of Gronwall's Inequality for billiard 
maps. Then we will show how these results imply the convergence required in 
Equation (14. 14ft . 
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Some inequalities for the collision map 

In this section, we consider the value of the slow variables to be fixed at ho G V. 
We will use the notation and results presented in Section 14.2.11 but because the 
value of the slow variables is fixed, we will omit it in our notation. 

Let p, 7, and A satisfy 0<p^C7^Cl^cA<oo. Eventually, these quantities 
will be chosen to depend explicitly on e, but for now they are fixed. 

Recall that the phase space Q for the collision map F is a finite union of 
disjoint rectangles and cylinders. Let d(-, •) be the Euclidean metric on connected 
components of Q. If x and x' belong to different components, then we set d(x, x') = 
oo. The invariant measure v satisfies v < const • (Lebesgue measure). For Acfl 
and a > 0, let Af a (A) = {i 6 : d(x, A) < a} be the a-neighborhood of A. 

For x G Q let Xk(x) = Xk = F h x, k > 0, be its forward orbit. Suppose x C lt \, 
where 

c 7 , A = (uj^F- k Mr(asi)) D^F-^iF-^idn))). 

Thus for < k < A, Xk is well defined, and from Equation ( 14. 2 j) it satisfies 

const 

d(x', x k ) < 7 => d(Fx',x k+1 ) < d{x',x k ). (4.15) 

7 

Next, we consider any p-pseudo-orbit x' k obtained from x by adding on an 
error of size < p at each application of the map, i.e. d(x' Q ,xo) < p, and for k > 1, 
^(x^, Fx^J < p. Provided d(xj, x'A < 7 for each j < k, it follows that 

^ f "\ ^ f const ^\ 

d{x k ,x' k ) < y— — J < const p^——J . (4.16) 

In particular, if p, 7, and A were chosen such that 

/ const \ A . _ „. 

const pi I < 7, (4.17) 



then Equation (I4.16P will hold for each k < A. We assume that Equation (14.171) 
is true. Then we can also control the differences in elapsed flight times using 
Equation (Oil : 



1 . , » 1 const p / const V , , 

\(x k -(x' k \< H -[ . (4.18) 

7 V 7 / 

It remains to estimate the size vC 1 ^\ of the set of x for which the above estimates 
do not hold. Using Lemma [4.3. II below. 

vC^ x < (A + l)(uM y (dQ) + vN^F^N^dn))) < C(A( 7 + 7 1/3 )) = C(A 7 1/3 ). 

(4.19) 
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Lemma 4.3.1. As 7 — > 0, 

vU y {F- l N y {dSl)) = o( 7 1/3 ). 

This estimate is not necessarily the best possible. For example, for dispersing 
billiard tables, where the curvature of the boundary is positive, one can show that 
i / Af 1 (F~ 1 Af 1 (dQ)) = C(7). However, the estimate in Lemma 14.3.11 is general and 
sufficient for our needs. 

Proof. First, we note that it is equivalent to estimate h>J\f 1 (FJ\L / (dft)), as F has 
the measure-preserving involution X(r, </?) = (r, —<p), i.e. F^ 1 = Xo FoX [CM06bJ. 

Fix a G (0, 1/2), and cover J\f 7 (dfl) with 0(^~ l ) starlike sets, each of diameter 
no greater than 0(7). For example, these sets could be squares of side length 7. 
Enumerate the sets as {Ai}. Set Q = {i : FAi fl Af^(dQ) = 0}. 

If i G Q, F\Ai is a diffeomorphism satisfying HDF^JI < 0(^~ a ). See Equa- 
tion (J32J). Thus diameter (FAi) < 0(^~ a ), and so 

diameter (Af^FAA) < 0(^- a ). 

Hence vN^FAA < O^ 1 " )), and vN^^gFAi) < 0(^~ 2a ). 

If i ^ Q, Ai fl F~ 1 (M 1 c (dVt)) 7^ 0. Thus Ai might be cut into many pieces 
by F~ l (dVt), but each of these pieces must be mapped near dfl. In fact, FAi C 
Afo^)(dtt). This is because outside F-\Mj«(d£l)), \\DF\\ < C(7" a ), and so 
points in FAi are n o more than a distance 0(7/7°) away from f\f 7 a(d£i), and 7 < 
y-a < 7 « It f n ows that Ny(FAi) C 7V 0(7a) (9fi), and i/W 0(7a )(<9ft) = 0(7°). 

Thus uN'^(F~ 1 N' 1 (dQ)) = 0( , ~f 1 ~ 2a + 7°), and we obtain the lemma by taking 
a = 1/3. 

□ 

Application to a perturbed billiard flow 

Returning to the end of Step 5 in Section l4\3.1[ let the initial conditions of the slow 
variables be fixed at h = (Q , Wo, £1,0, -£"2,0) £ V throughout the remainder of this 
section. We can assume that the billiard dynamics of the left gas particle in T>i(Q ) 
are ergodic. Also, fix a particular value of the initial conditions for the right gas 
particle for the remainder of this section. Then z £ (t) and T £ may be thought of as 
random variables depending on the left gas particle's initial conditions y G Ai 1 . 
Now if h £ (t) = (Q e (t), W e (t), Ei jE (t), E 2>e (t)) denotes the actual motions of the slow 
variables when e > 0, it follows from Equation (I4.13P that, provided eL(e) < T e , 

sup \ho - h £ (t)\ = 0(sL(e)). (4.20) 

0<t<L(e) 
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Furthermore, we only need to show that 



plyeM 1 : 



L{e 



' G(z £ (s)) - G(z (s))ds 







> — and sL{s) < T £ } - 



(4.21) 

as e — > 0, where G is defined in Equation (I4.12p . 

For definiteness, we take the following quantities from Subsection 14.3.21 to de- 
pend on e as follows: 

L(e) — L— log log -, 

7 (e) = 7 = e~ L , 

p\e) — p — const — . 

7 

The constant in the choice of p and p's dependence on e will be explained in the 
proof of Lemma r4.3.31 which is at the end of this subsection. The other choices may 
be explained as follows. We wish to use continuity estimates for the billiard map 
to produce continuity estimates for the flow on the time scale L. As the divergence 
of orbits should be exponentially fast, we choose L to grow sublogarithmically in 
e^ 1 . Since from Equation (I4.4p the expected flight time between collisions with 



&Di(Qo) when e = is E u ( = it |X>i(Qo)| /(y2i^o \9T>i(Qo)\), we expect to see 
roughly A/2 collisions on this time scale. Considering A collisions gives us some 
margin for error. Furthermore, we will want orbits to keep a certain distance, 
7, away from the billiard discontinuities. 7 — > as e — >• 0, but 7 is very large 
compared to the possible drift 0{sL) of the slow variables on the time scale L. In 
fact, for each C, m, n > 0, 



0(ee constL ) ^0 ase^O. (4.23) 



Let X : A4 1 — > f2 be the map taking e to x = X(y) e f2, the location 
of the billiard orbit of y in the collision cross-section that corresponds to the most 
recent time in the past that the orbit was in the collision cross-section. We consider 
the set of initial conditions 

£ £ = x-\n\c^) fix- 1 \xen-. Y,C{F k x) > l\ . 

I fc=0 J 

Now from Equations (I4.19P and f)4.22p . z/C 7i a — > as e — > 0. Furthermore, by the 
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ergodicity of F, 

vlxeVl :J2C(F k x) < l\ = v\x G Q : A _1 ^C(^) < ^0 

I k=0 J I fc=0 J 

as e — > 0. But because the free flight time is bounded above, /iX _1 < const -v, and 
so fi£ £ — > 1 as e — > 0. Hence, the convergence in Equation (I4.2ip and the conclusion 
of the proof in Section 14.3. II follow from the lemma below and Equation (14. 23ft . 

Lemma 4.3.2 (Analysis of deviations along good orbits). As e — > 0, 



sup 

ye£ s n{eL<f £ } 



G(z £ (s)) - G(z (s))ds 



( const\ ' 



0(P(-H )+O(L- 1 )-,0. 



Proof. Fix a particular value of y G £ e fl jeL < T e |. For convenience, suppose that 

y = X(y) = x G Q. Let ?/o(^) denote the time evolution of the billiard coordinates 
for the left gas particle when e — 0. Then there is some N < A such that the 
orbit Xk = F k x = (r^, ipk) for < k < N corresponds to all of the instances 
(in order) when yo(t) enters the collision cross-section Q = Qh corresponding to 
collisions with dT>i(Qo) for < t < L. We write Qh to emphasize that in this 
subsection we are only considering the collision cross-section corresponding to the 
billiard dynamics in the domain T>i(Qq) at the energy level -Ea,o- In particular, F 
will always refer to the return map on Qh . 

Also, define an increasing sequence of times tk corresponding to the actual 
times yo(t) enters the collision cross-section, i.e. 

to = 0, 

t k = t k _i + Cx fc _i for k > 0. 
Then x k = yo(tk)- Furthermore, define inductively 

N± = inf {k > : t k corresponds to a collision with the piston} , 
Nj = inf {k > Nj-i : t k corresponds to a collision with the piston} . 

Next, let y e (t) denote the time evolution of the billiard coordinates for the left 
gas particle when e > 0. We will construct a pseudo-orbit x' ke = (r ke ,(f' kE ) of 
points in fi^ that essentially track the collisions (in order) of the left gas particle 
with the boundary under the dynamics of y e (t) for < t < L. 

First, define an increasing sequence of times t' ke corresponding to the actual 
times y £ (t) experiences a collision with the boundary of the gas container or the 
moving piston. Define 

N' £ = sup {k > : t' k£ < L] , 

N[ e = inf {A; > : t' k e corresponds to a collision with the piston} , 

N's = inf {A; > iV'-_ le : t' ke corresponds to a collision with the piston} . 
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Because L < T £ (y)/e, we know that as long as A^- +1 < N' e , then N'- +1 ^ £ — N'a £ > 2. 
See the discussion in Subsection 14.2.21 Then we define x' ke G Qh by 

x , = U(t' k , £ ) if * i W, £ } , 

k ' £ l/^W ^ ke {N' j;E } . 

Lemma 4.3.3. Provided e is sufficiently small, the following hold for each k G 
[0,NAN' e ). Furthermore, the requisite smallness of e and the sizes of the constants 
in these estimates may be chosen independent of the initial condition y G £ £ R 
< T e | and of k: 

(a) x' k e is well defined. In particular, if k £ |iVj e } ; y £ (t' k £ ) corresponds to a 
collision point on &Di(Q ), and not to a collision point on a piece of &D to 
the right of Q$. 

(b) lfk>0 andk £ {N' j£ }, then x' Ke = Fx' k _ l£ . 

(c) If k > and k G {Nj £ j, then d(x' k£ ,Fx' k _ l£ ) < p and the ip coordinate of 
y £ it' k £ ) satisfies <p(y £ (t' k)E )) = <p' k)E + 0(e). 

(d) d(xk, x' k ) < const p(const/ r y) k . 

(e) k = Nj £ if and only if k = Nj . 

(f) lfk>0, t' k>£ - t' k _ 1>£ = t k - t fc _! + 0(p {constat). 

We defer the proof of Lemma 14.3.31 until the end of this subsection. Assuming 
that e is sufficiently small for the conclusions of Lemma 14.3.31 to be valid, we 
continue with the proof of Lemma 14.3.21 

Set M = N A N £ - 1. Note that M < A ~ L. From (f) in Lemma ET3T31 and 
Equations fQ2"]) and (T4~23l) . we see that 

M 

\tM - t'M,e\ ^ ^ V k > £ ~ ~~ ^ k ~~ ^-i) 

k=l 

Because the flight times t' k e — t' k _ le and t k — t k _i are uniformly bounded above, 
it follows from the definitions of A" and A^ that tu, t' Me > L — const. But from 
Subsection 14.2.21 the time between the collisions of the left gas particle with the 
piston are uniformly bounded away from zero. Using (c) and Equation (14.201) . it 



, , const A . 

O ( p j— ) -> as e -> 0. 
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follows that 



G{z e {s)) - G{z {s))ds 



k&{Nj:Nj<M} 

0{L- X ) + | V^fi cos Vk - V^E^o cos <p' k>e + 0{eL) 

ke{Nj:Nj<M} 

OiL- 1 ) + 0[eL 2 ) + ^ |cos^-cos^ )£ |. 

kelNj-.NjKM} 



But using (d), 

AT 

|cos<^ - cosy/ fc J < ^C(p(const/7) fc ) = £>(p(const/7) A ). 

ke{Nf.Nj<M} k=0 

Since eL 2 = (9(p(const/7) A ), this finishes the proof of Lemma [4.3.21 

□ 



Proof of Lemma \4-3.3\ The proof is by induction. We take e to be so small that 
Equation (I4.17P is satisfied. This is possible by Equation (14.231) . 

It is trivial to verify (a)-(f) for k = 0. So let < I < N A N' £ , and suppose that 
(a)-(f) have been verified for all k < I. We have three cases to consider: 



Case 1: / - 1 and / £ {N' je }: 

In this case, verifying (a)-(f) for k = I is a relatively straightforward application 
of the machinery developed in Subsection 14.3.21 because for t[_ l £ <t<t' le , y e (t) 
traces out the billiard orbit between x' l _ le and x' le corresponding to free flight in 
the domain T>i(Qq). We make only two remarks. 

First, as long as e is sufficiently small, it really is true that x' le = y £ (t' l£ ) 
corresponds to a true collision point on &Di(Qq). Indeed, if this were not the 
case, then it must be that Q £ (t' le ) > Q , and y e (t'i e ) would have to correspond 
to a collision with the side of the "tube" to the right of Q . But then x" £ = 
F x 'i-i £ ^ ^h would correspond to a collision with an immobile piston at Qo and 
would satisfy d(xk,x'l e ) < const p( const /•y) k < const p{ const /^) x = 0(7), using 
Equations (14.161) and (14.231) . But x^ ^ Af y (d£lh ), and so it follows that when the 
trajectory of y e (t) crosses the plane {Q = Qo}, it is at least a distance ~ 7 away 
from the boundary of the face of the piston, and its velocity vector is pointed 
no closer than ~ 7 to being parallel to the piston's face. As Q £ (t' le ) — Q = 
0(sL) = 0(7), and it is geometrically impossible (for small e) to construct a right 
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triangle whose sides si, s 2 satisfy >~ 7, |s 2 | < 0(eL), with the measure of 
the acute angle adjacent to Si being greater than ~ 7, we have a contradiction. 
After crossing the plane {Q = Qo}, y £ (t) must experience its next collision with 
the face of the piston, which violates the fact that I ^ {iVj ]■. 

Second, t\ £ — t' l _ 1 £ = (x' l _ 1 £ + 0(eL), because Vi >e = V\ t o + 0(eL). See Equa- 
tion fTOUD . From Equation EH |Ox/-i ~ C^_i, e | < °((p/l) (const/7)^ 1 ). As 
ti — ti-x = C x i-i an d sL = 0((p/'j) (const/7)^ 1 ), we obtain (f). 

Case 2: There exists % such that I = N' ie : 

For definiteness, we suppose that Q e {t'i E ) > Qo, so that the left gas particle collides 
with the piston to the right of Q . The case when Q £ (t' l e ) < Q can be handled 
similarly. 

We know that xi-i, xi, xi + i ^ J\f 7 (dQh ) U A/' 7 (F~ 1 A/' 7 (9r2/ l0 )). Using the induc- 
tive hypothesis and Equation (14. 16ft . we can define 

and d(xi,Xi e ) < const p(const/7)', d(xi+i, x" +1 e ) < const p(const/7) i+1 . In partic- 
ular, x'( £ and x'l +l e are both a distance ~ 7 away from dQh ■ Furthermore, when 
the left gas particle collides with the moving piston, it follows from Equation (14.81) 
that the difference between its angle of incidence and its angle of reflection is 
0(e). Referring to Figure I4.3[ this means that ip' le = ip" £ + 0(e). Geometric 
arguments similar to the one given in Case 1 above show that the ?/ £ -trajectory 
of the left gas particle has precisely one collision with the piston and no other 
collisions with the sides of the gas container when the gas particle traverses the 
region Q < Q < Q £ (t' l£ ). Note that x' le was defined to be the point in the 
collision cross-section corresponding to the return of the y e -trajectory into 
the region Q < Q Q . See Figure 14.31 From this figure, it is also evident that 
d{r[ e ,r" e ) < 0(eL/j). Thus d(x" £ ,x' l£ ) = 0(eL/j), and this explains the choice 
of p{e) in Equation (J4J22]). 

From the above discussion and the machinery of Subsection I4.3.2[ (a)-(e) now 
follow readily for both k = I and k = I + 1. Furthermore, property (f) follows in 
much the same manner as it did in Case 1 above. However, one should note that 
t' l>£ -ti_ ljE = (xl^ + Oie^ + OieLM and^-t^ = C^ + 0(eL) + 0(eL/-y), 
because of the extra distance G(eLH) that the gas particle travels to the right of 
Q . But eL/^j = 0{{p/^j) (const/7)^ 1 ), and so property (f) follows. 

Case 3: There exists i such that I — 1 = N[ £ : 

As mentioned above, the inductive step in this case follows immediately from our 
analysis in Case 2. 

□ 
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Vi(Qo) 



r— coordinate 




0{eL) 

Figure 4.3: An analysis of the divergences of orbits when e > and the left 
gas particle collides with the moving piston to the right of Q . Note that the 
dimensions are distorted for visual clarity, but that eL and eL/j are both 0(7) as 
e -> 0. 

Furthermore, </?" £ G (-7r/2 + 7/2,7r/2 - 7/2) and y^ £ = </?" £ + 0(e), and so 
r / e = r ie + 0( £ L/l)- In particular, the y e -trajectory of the left gas particle has 
precisely one collision with the piston and no other collisions with the sides of the 
gas container when the gas particle traverses the region Q < Q < Q £ (t' l £ ) 
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4.4 Generalization to a full proof 
of Theorem 14.1.1 



It remains to generalize the proof in Sections 14.21 and !4.3l to the cases when m, ri2 > 
1 and d = 3. 

4.4.1 Multiple gas particles on each side of the piston 

When d = 2, but n\,n2 > 1, only minor modifications are necessary to generalize 
the proof above. As in Subsection I4.2.2[ one defines a stopping time T £ satisfying 

P |t £ < T A T £ | = 0{e) such that for < t < f e /e, gas particles will only 
experience clean collisions with the piston. 
Next, define H(z) by 



H(z) = 







W 








=Q ~ 






-2W 


K\ 






+2W 







It follows that for < t < f £ /e, h e (t)-h B (0) = 0(e)+e jj H(z £ {s))ds. From here, 
the rest of the proof follows the same steps made in Subsection 14.3.11 We note 
that at Step 3, we find that H(z) — H(h(z)) divides into n\ + n 2 pieces, each of 
which depends on only one gas particle when the piston is held fixed. 

4.4.2 Three dimensions 

The proof of Theorem 14. 1.11 in d = 3 dimensions is essentially the same as the proof 
in two dimensions given above. The principal differences are due to differences in 
the geometry of billiards. We indicate the necessary modifications. 

In analogy with Section 14.2.11 we briefly summarize the necessary facts for the 
billiard flows of the gas particles when M = oo and the slow variables are held 
fixed at a specific value h £ V. As before, we will only consider the motions of one 
gas particle moving in "D\. Thus we consider the billiard flow of a point particle 
moving inside the domain T>\ at a constant speed \/2E[. Unless otherwise noted, 
we use the notation from Section 14.2.11 

The billiard flow takes place in the five-dimensional space Ai 1 = {{qi,Vi) G 
TV i : qi G Vi, \v±\ = y/2Ei}/ ~. Here the quotient means that when q± G dV±, 
we identify velocity vectors pointing outside of V\ with those pointing inside V\ 
by reflecting orthogonally through the tangent plane to &V\ at q± . The billiard 
flow preserves Liouville measure restricted to the energy surface. This measure 
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has the density d\i = dqidvi/(8nE 1 \T>i\). Here dq 1 represents volume on R 3 , and 
dvi represents area on S^et = { Vl e ^ 3 • \ Vl \ = V^Ei}- 

The collision cross-section fl = {(?i,fi) G TT>! : g x G |t>i| = y / 2E 1 }/ ~ is 

properly thought of as a fiber bundle, whose base consists of the smooth pieces of 
dT>i and whose fibers are the set of outgoing velocity vectors at q± G &D\. This and 
other facts about higher- dimensional billiards, with emphasis on the dispersing 
case, can be found in [BCST03J. For our purposes, fl can be parameterized as 
follows. We decompose &D\ into a finite union U^Tj of pieces, each of which 
is diffeomorphic via coordinates r to a compact, connected subset of R 2 with 
a piecewise C 3 boundary. The Tj are nonoverlapping, except possibly on their 
boundaries. Next, if (<7i,i>i) G fl and v\ is the outward going velocity vector, let 
v = v\j \v\\. Then fl can be parameterized by {x = (r,v)}. It follows that fl it is 
diffeomorphic to U 3 T 3 - x S 2+ , where S 2+ is the upper unit hemisphere, and by dfl 
we mean the subset diffeomorphic to (UjdTj x S 2+ ) U(Uj r j x dS 2+ ). If x G fl, 
we let <f G [0, 7r/2] represent the angle between the outgoing velocity vector and 
the inward pointing normal vector n to &Di, i.e. cosy? = (v,n). Note that we no 
longer allow (p to take on negative values. The return map F : Q O preserves the 
projected probability measure u, which has the density dv = cosipdv dr/(7T \&Di\). 
Here | &D\ \ is the area of &D\ . 

F is an invertible, measure preserving transformation that is piecewise C 2 . 
Because of our assumptions on V 1 , the free flight times and the curvature of &D\ 
are uniformly bounded. The bound on given in Equation ( 14. 2ft is still 

true. A proof of this fact for general three-dimensional billiard tables with finite 
horizon does not seem to have made it into the literature, although sec [BCST03] 
for the case of dispersing billiards. For completeness, we provide a sketch of a 
proof for general billiard tables in Section 14.61 

We suppose that the billiard flow is ergodic, so that F is ergodic. Again, we 
induce F on the subspace Cl of Q corresponding to collisions with the (immobile) 
piston to obtain the induced map F : Cl O that preserves the induced measure v. 

The free flight time £ : Q — > R again satisfies the derivative bound given in 
Equation (14.31) . The generalized Santalo's formula [Che97j yields 

EuC 



4|£>i 




\vi\ 





If ( : Cl — > R is the free flight time between collisions with the piston, then it 
follows from Proposition 14.5.11 that 

% 4|X>i| 
EoC 



\v x \l 

The expected value of tin when the left gas particle collides with the (immo- 
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bile) piston is given by 



^/2,E\ r r 2 

Ep\vi\ = Epy/2EiCD8(p = // cos 2 (pdvi = \f2E\-. 

7T JJ S 2+ 3 

As a consequence, we obtain 
Lemma 4.4.1. For fx — a.e. y G A4 1 , 

lira \ I K( s )\^Hs)=Q ds -s^M- 

Compare the proof of Lemma 14.2. 1[ 

With these differences in mind, the rest of the proof of Theorem 14.1.11 when 
d = 3 proceeds in the same manner as indicated in Sections 14.21 14.31 and 14.4.11 
above. The only notable difference occurs in the proof of the Gronwall-type in- 
equality for billiards. Due to dimensional considerations, if one follows the proof 
of Lemma [4.3.11 for a three-dimensional billiard table, one finds that 

uAf^F^Af^dQ)) = 0( 7 1 " 4a + 7 a ). 

The optimal value of a is 1/5, and so vN^F^N^dVt)) = C( 7 1 / 5 ) as 7 -> 0. 
Hence i/C 7jA = 0(X^ 5 ), which is a slightly worse estimate than the one in Equa- 
tion (I4.19p . However, it is still sufficient for all of the arguments in Section f4.3.2[ 
and this finishes the proof. 



4.5 Inducing maps on subspaces 

Here we present some well-known facts on inducing measure preserving transfor- 
mations on subspaces. Let F : (O, 03, v) be an invertible, ergodic, measure 
preserving transformation of the probability space Q endowed with the cx-algebra 
05 and the probability measure v. Let Cl e 23 satisfy < uCl < 1. Define 
R : Cl — ► N to be the first return time to Cl, i.e. Ruj = inf{n G N : F n uj G &}. 
Then if v := u(- n Cl)/vti and 23 := {B n Cl : B G 23}, F : (Cl, 23, u) O defined 
by Fuj = F Ruj uj is also an invertible, ergodic, measure preserving transforma- 
tion [Pet83j . Furthermore E„R = j^Rdv = (uCl)- 1 . 

This last fact is a consequence of the following proposition: 

Proposition 4.5.1. If C, : Vt — > M>o is in L l (v), then Q = J2n=o C°E n is in L 1 ^), 
and 

EoC = Xe„(. 
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Proof. 



„ R—l „ R— 1 oo „ fe— 1 

/ V C o F n dv = / V C o F n dv = V / V C o F n dz/ 

./fi n=0 7n n=0 k=l Jnn{R=k} n=0 

oo fc— 1 „ „ 

= EE / . cdu= / c*>, 

fe=1 n=0 JF"(nn{R=k}) Jn 
because {F n (Q fl {R = k}) : < n < k < oo} is a partition of Q. 



□ 



4.6 Derivative bounds for the billiard map 
in three dimensions 

Returning to Section 14.4.21 we need to show that for a billiard table V>\ C M 3 with 
a piecewise C 3 boundary and the free flight time uniformly bounded above, the 
billiard map F satisfies the following: If xq dQ U F~ l (dQ), then 

, „ ,. const 
\\DF(x )\\ < 



coscp(Fxo) 



Fix xq = (r ,v ) G Q, and let X\ = (ri,vx) = Fx . Let E be the plane that 
perpendicularly bisects the straight line between r and r 1; and let r ly / 2 denote 
the point of intersection. We consider E as a "transparent" wall, so that in a 
neighborhood of Xq, we can write F = F 2 o F x . Here, Fi is like a billiard map in 
that it takes points (i.e. directed velocity vectors with a base) near xq to points 
with a base on E and a direction pointing near r\. (Fi would be a billiard map if 
we reflected the image velocity vectors orthogonally through E.) F 2 is a billiard 
map that takes points in the image of F\ and maps them near x\. Let X\ii = 
F lXo = F^xl Then ||£>F(z )|| < P*i(zo)|| ||£>F a (x 1/2 )||. 

It is easy to verify that ||DFi(xo)|| < const, with the constant depending 
only on the curvature of &D\ at tq. In other words, the constant may be chosen 
independent of Xq. Similarly, ||Di 71 2 _1 (xi) || < const. Because billiard maps pre- 
serve a probability measure with a density proportional to cosy?, detZ^F 2 _1 (xi) = 
cos^i/ cos^i/2 = cos<£>i. As Q is 4-dimensional, it follows from Cramer's Rule for 
the inversion of linear transformations that 

i, np/ , const ||DF 2 _1 (xi)|| 3 const 
\\DFo £1/2 < t, — < 



det-DF 2 1 (x 1 ) cosy?i 



and we are done. 
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